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ON REGULATE EXPRESSION OF NUCLEOTIDE SEQUENCES 

(57) Abstract: This invention relates to the use of a system in which expression of a reporter gene is indicative of the activity of 
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said promoter activity through expression of said reporter gene which resides within the nucleotide sequence being regulated by 
said promoter. Mutagenesis of cells which contain said reporter gene system allows isolation of upstream regulatory mutations or 
modifier mutations. 
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System and Method to Detect Compounds and Mutations that 
Upregulate and Downregulate Expression of Nucleotide Sequences 



This application claims priority to a U.S. provisional application 60/140,920 filed 
5 June 24, 1999. 

The work herein was supported by grants from the United States Government. 
The United States Government may have certain rights in the invention 

10 FIELD OF THE INVENTION 

This invention relates to the use of a reporter gene system to detect alterations in the 
activity of a promoter. More particularly it relates to the use of said reporter gene system 
to screen for mutations and compounds which upregulate or downregulate a particular 
15 nucleotide sequence expression. 

BACKGROUND OF THE INVENTION 

The molecular genetics of genomic imprinting can be studied utilizing the 
20 homologous regions of mouse chromosome 7C and human chromosome 15ql l-ql3 that 
include the genes responsible for Prader-Willi syndrome (PWS) and Angelman syndrome 
(AS) in the human. PWS and AS are distinct neurogenetic disorders caused by deficiency 
of paternal or maternal gene expression respectively from human chromosome 15ql l-ql3 
(Jiang et al., 1998; Nicholls et al., 1998). AS is caused by maternal deficiency for E6-AP 
25 ubiquitin-protein ligase (Malzac et al., 1998; Fang et al., 1999), but the genes causing PWS 
are not known. The human SNRPN and mouse Snrpn loci are within the PWS critical region 
and are expressed exclusively from the paternal chromosome and encode a small protein of 
unknown significance SNURF and small nuclear ribonucleoprotein-associated polypeptide 
N (SmN) (Ozpelik et al., 1992; Leff et al., 1992; Reed and Leff, 1994). The SNRPN/Snrpn 
30 loci in human and mouse are associated with a CpG island that encompasses the promoter 
and is heavily methylated exclusively on the maternal, silenced chromosome. Snrpn is 
known to be ubiquitously expressed in the mouse with abundant expression in brain and 
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heart and lower levels of expression in a wide range of tissues in both adults and developing 
embryos (Barr et al., 1995). All available evidence is consistent with very tight silencing 
of SNRPN/Snrpn on the maternal chromosome in human and mouse in all tissues and at all 
developmental stages studied to date (Barr et al., 1995). Furthermore, mice lacking Snrpn 
5 expression on the basis of maternal uniparental disomy show normal regulation of neuronal 
alternative splicing events (Huntriss et al., 1994). There is evidence based on disruption of 
SNRPN in two patients with PWS (Sun et al., 1996; Kuslich et al., 1999) that paternal 
deficiency of SNRPN may cause PWS, but there are also rare patients suggesting exclusion 
of SNRPN as a causative gene in PWS (Schulze et al., 1996; Conroy et al., 1997). However 

1 0 mice with maternal uniparental disomy (UPD) causing paternal deficiency for chromosome 
7C (Cattanach et al., 1992) or with an imprinting mutation causing paternal deficiency of 
Snrpn and the surrounding region (Yang et al., 1 998) demonstrate postnatal lethality that has 
been proposed to be the equivalent of the neonatal PWS phenotype, but mice with a 
heterozygosity or homozygosity for a null mutation for Snrpn do not show lethality or any 

15 other obvious phenotype (Yang et al., 1998). 

There is evidence for both cis elements and trans-acting factors in the regulation of 
imprinted gene expression (Surani, 1998; Reik and Walter, 1998). In particular, there is 
evidence for a bipartite imprinting center (IC) at the site of the SNRPNpromoter and further 
upstream in the human, with a more proximal region required for switching from maternal 

20 to paternal epigenotype and a more upstream region required for switching from paternal to 
maternal epigenotype (Dittrich et al., 1996). More generally, there are extensive data 
regarding the role of promoters, enhancers, and a genomic mark in the imprinted expression 
of HI 9 and Ig/2 in the mouse (Bartolomei and Tilghman, 1997). Numerous trans-acting 
factors must be required for the switching, establishment, and maintenance of genomic 

25 imprinting. For example, absence of the DNA methyltransferase in mice causes lethality 
and abnormalities of imprinting (Li et al., 1992; Li et al., 1993; Warnecke et al., 1998). The 
interaction of cis elements with trans-acting factors may establish a hierarchical control 
mechanism for the initiation and maintenance of genomic imprinting, and mutations might 
affect various aspects of this mechanism. In some instances, cis elements taken from 

30 imprinted regions in the mouse and human have been shown to have silencing effects in 
Drosophila (Lyko et al., 1997; Lyko et al., 1998). 
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By expressing coat color markers under the control of various promoters, this 
approach could provide a general strategy for screening for regulatory mutations using 
ethylnitrosourea (ENU) or insertional mutagenesis (Justice et al., 1997; Schimenti and 
Bucan, 1998; Woychik and Alagramam, 1998; Battey et al., 1999). Genetic screens based 
5 on easily visible markers in Drosophila have been the cornerstone for dissecting genetic and 
epigenetic (e.g., transfection and position-effect variegation) regulatory mechanisms. The 
strategy of genetic fusions as experimental tools is well established in bacteria (Slauch and 
Silhavy, 1991). 

Although about 100 genes in the mouse have been identified as affecting coat color 

10 (Barsh, 1996; Jackson, 1994), the agouti locus is particularly attractive for use in these 
studies because the genetics are well characterized, and phenotypic effects are known over 
a wide range of expression. The agouti gene on mouse chromosome 2 encodes a 1 3 1 amino 
acid secreted protein(Bultman et al., 1992; Miller et al., 1993) that is an antagonist of the 
melanocyte-stimulating-hormone (MSH) receptor (Lu et al., 1994). The agouti protein acts 

15 in a paracrine manner to inhibit stimulation of melanogenesis by aMSH and induces 
melanocytes to switch from producing black pigment (eumelanin) to yellow pigment 
(phaeomelanin) (Silvers, 1979). At least 19 dominant and recessive alleles have been 
identified at the agouti locus (Green, 1989). The most common alleles in laboratory mice 
are agouti (A), which is dominant and yields a brown coat color, and the recessive nonagouti 

20 (a) allele, which yields a black color in homozygotes. The agouti locus is also attractive for 
study because extremely low levels of expression yield a phenotypically significant change 
in coat color compared to homozygotes for the extreme nonagouti allele (a e ), a null mutation 
(Hustad et al., 1995). It is predicted that even low levels of expression of the protein encoded 
by the A allele might yield visible changes in coat color. 

25 In mice, agouti is expressed only in the skin during the time of phaeomelanin 

synthesis and not in brain, liver, kidney, or numerous other tissues (Bultman et al., 1992; 
Miller et al., 1993). However, several dominant mutations of the agouti locus or ectopic 
production controlled by a transgene with a heterologous promoter are associated with 
increased yellow pigment in the coat, obesity, diabetes, and susceptibility to tumors (Duhl 

30 et al., 1994a; Duhl et al., 1994b; Wolff et al., 1987). Furthermore, the ubiquitous 
overexpression of agouti under the control of the p-actin or Pgk-1 promoter in transgenic 
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mice also gives rise to yellow, obese mice (Klebig et al., 1995; Perry et al., 1995). Michaud 
et al. described a new dominant agouti allele designated A iapy (Michaud et al., 1994). This 
mutation arose through the spontaneous insertion of an intracistemal A-particle (TAP) 
proviral element 5 1 -bp upstream of the first coding exon of the agouti locus. These authors 
5 demonstrated that ectopic expression of the agouti gene arose through the utilization of a 
cryptic promoter within the 5' long terminal repeat (LTR) of the provirus. In addition, the 
methylation status of the 5' LTR was inversely correlated with the level ofA iapy expression; 
most importantly, methylation and expression varied with maternal versus paternal 
inheritance (Michaud et al., 1994). This nonphysiological "imprinting" of theA iapy allele 

1 0 provided a stimulus to explore the strategy of expressing the agouti cDNA under the control 
of an imprinted promoter. The coat color of mice with the A iapy allele ranges from solid 
yellow to pseudoagouti, the latter being similar to the A/ A genotype. Mice with high levels 
of ectopically expressed agouti mRNA show yellow coat color and marked obesity, whereas 
pseudoagouti mice express the agouti gene ectopically at low levels, and their body weights 

15 are similar to controls (Michaud et al., 1994) 

There is a general appreciation in the research community that better methods are 
needed for mutagenesis and genetic screening in mammalian organisms such as the mouse 
and that better methods are needed for screening for compounds that affect gene expression. 
As in the example of the Big Blue mouse system advocated by Stratagene (La Jolla, CA), 

20 these are very cumbersome systems by comparison to that which is proposed in this patent 
application. The intensity of scientific interest in the mouse is well summarized in a recent 
article entitled "An action plan for mouse genomics" (Battey et al., 1999). The intensity of 
scientific interest is also indicated by two RFA's from the NIH: HD-99-007 entitled "Mouse 
mutagenesis and phenotyping: Developmental defects" and MH-99-007 entitled "Mouse 

25 mutagenesis and phenotyping: Nervous system and behavior." In general, molecular genetic 
manipulations possible in the mouse can ordinarily be carried out in other mammalian 
species and lower organisms once the feasibility is demonstrated in the mouse. This patent 
application proposes a general method to screen for genetic effects and/or effects of various 
compounds on gene expression, both transcriptional and posttranscriptional. 

30 This invention demonstrates for the first time the use of knockin/fusions in higher 

eukaryotic organisms to detect compounds and mutations that upregulate and downregulate 
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expression of a specific gene. It is noteworthy that although similar methods in bacteria 
have existed for nearly a decade, the idea and use of a similar general strategy for such 
detection in mammals or other organisms has gone unrealized, suggesting that this invention 
is indeed nonobvious. 

5 

SUMMARY OF THE INVENTION 



An embodiment of the present invention is a system to determine elements that 
regulate expression of a nucleotide sequence comprising a reporter sequence inserted within 
1 0 said nucleotide sequence under conditions wherein the expression of the reporter sequence 
is controlled by the promoter for said nucleotide sequence. In specific embodiments, the 
reporter sequence can be inserted into the part of the sequence coding for the N-terminal 
portion of a protein, the C-terminal portion of a protein or the 3' untranslated region of the 
nucleotide sequence. 

15 In a preferred embodiment of the present invention two different reporter sequences 

are inserted within the nucleotide sequence and both reporter sequences are controlled by 
the promoter for the nucleotide sequence. 

In specific embodiments of the present invention the nucleotide sequence of the 
system is selected from the group of organisms consisting of mammals, non-mammalian 

20 animals, plants, insects, aquatic organisms and avian species. 

An additional embodiment includes a composition of matter wherein a functional 
system is inserted into the genome of a mammal, a non-mammalian animal, a plant, an 
insect, an aquatic organism, or an avian species. Additionally a functional system can be 
inserted into a cell. 

25 An additional embodiment of the present invention is the method of identifying an 

upregulator or downregulator of expression of a specific nucleotide sequence comprising the 
steps of inserting a test system into a test organism wherein the test system comprises a 
reporter sequence inserted into a specific nucleotide sequence under conditions wherein the 
expression of the reporter sequence is controlled by the promoter for the nucleotide 

30 sequence; delivering a compound to the test organism containing the test system; and 
measuring the effect of said compound on said reporter sequence wherein when said 
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compound increases the expression of said reporter sequence it is an upregulator and when 
said compound decreases the expression of said reporter sequence it is a downregulator. 

In a specific embodiment the method can be used to identify compounds that affect 
postranscriptional control at the RNA level when the reporter is inserted by homologous 
5 recombination and the resulting construct expresses a fusion transcript. 

In an additional specific embodiment, the method can be used to identify compounds 
that affect posttranscriptional control at the protein level by inserting the reporter sequence 
into the nucleotide sequence by homologous recombination and such construct expresses 
a fusion protein 

10 In specific embodiments, the method utilizes a test organism selected from the group 

consisting of a mammal, a non-mammalian animal, a plant, an insect, an aquatic organism, 
and an avian species . 

A further embodiment of the present invention is the method for identifying an 
upregulator or downregulator of expression of a specific nucleotide sequence comprising the 

15 steps of inserting a test system into a test organism, wherein the test system comprises a 
reporter sequence inserted within the specific nucleotide sequence under conditions wherein 
the expression of the reporter sequence is controlled by the promoter for the specific 
nucleotide sequence, mutagenizing the test organism containing the test system, and 
measuring the effect of said mutagenesis on expression of said reporter sequence wherein 

20 when the mutant resulting from said mutagenizing increases the expression of the reporter 
sequence, the mutant is an upregulator and when the mutant resulting from said 
mutagenizing decreases the expression of said reporter sequence, the mutant is a 
downregulator. 

In a specific embodiment, the method further comprises the steps of delivering a 
25 compound to the mutagenized test organism and measuring the effect of said compound on 
said reporter sequence, wherein when said compound increases the expression of said 
reporter sequence the compound is an upregulator and when said compound decreases the 
expression of said reporter sequence the compound is a downregulator. 

In a preferred embodiment, the method uses two different reporter sequences inserted 
30 within the nucleotide sequence and both are controlled by the same promoter. 
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A further embodiment of the present invention is the method of identifying an 
upregulator or a downregulator to treat a disease in which there is a deficiency in expression 
(upregulator) or excess in expression (downregulator) of a nucleotide sequence. 

An additional embodiment of the present invention is the method of treating a 
5 disease in an organism in which the disease is the result of a deficiency or excess in 
expression of a nucleotide sequence comprising the step of introducing a pharmacologically 
effective dose of an upregulator (deficiency) or downregulator (excess). 

A further embodiment of the present invention is the method to identify an 
upregulator or downregulator to enhance food content of an animal, aquatic organism, avian 
1 0 species, or plant. 

Other and further obj ects, features and advantages would be apparent and eventually 
more readily understood by reading the following specification and by reference to the 
company drawing forming a part thereof, or any examples of the presently preferred 
embodiments of the invention are given for the purpose of the disclosure. 

15 

DESCRIPTION OF THE DRAWINGS 



Figures la through lb show generation of agouti knockin mice by homologous 
recombination. Figure la illustrates the structure of the targeting vector and partial 

20 restriction map of the Snrpn wild type allele and agouti knockin allele after homologous 
recombination. Differentially methylated sites of Snrpn intron 1 detected by methylation 
sensitive enzymes ifrsHII and Sacll are shown below the line. The 5' and 3' flanking probes 
are indicated. Restriction enzymes and other designations are as follows: B, BssHIl; E, 
EcoRI; H, Hindlll; P, Pstl; S, Sail; X, Xbal; Sc, Sacll; Ag, agouti cDNA; pA, rabbit P- 

25 globin polyadenylation signal; Neo, neomycin expression cassette; TK, HSV-tk expression 
cassette; and Me probe, probe for methylation study presented in Figure 2. ATG 1 represents 
the potential start site for a small (SNURF) open reading frame in the Snrpn gene of 
uncertain significance. ATG 2 is the agouti start codon and is not in frame with ATG 1 , ATG 3 
is the start codon for the major (SmN) open reading frame for Snrpn. Figure lb is a Southern 

30 blot analysis of ES cell DNA and mouse tail DNA from heterozygous matings using 3' 
flanking probe. The Psil 9-kb fragment was detected in the wild type allele ( WT), and a Pstl 
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5.2-kb fragment was detected in the agouti knockin allele (Ag KI). Genotypes are indicated 
as wild type (+) or transgenic (T). 

Figures 2a through 2b show a methylation study of the Snrpn CpG island in three 
generations of mice carrying the Snrpnl agouti knockin/fusion sequence. Figure 2a depicts 
5 breeding of the mice with agouti knockin/fusion sequence expressed from Snrpn promoter. 
Symbols are as follows: squares, males; circles, females; checkered symbols, chimera; filled 
symbols, mice expressing transgene; symbols with central dots, mice with transgene but not 
expressed. Figure 2b shows genomic DNA from mice 1-1 , II-2, and III-2 indicated in panel 
a was digested with Hindlll (H) alone, Hindlll plus Sacll (Sc), or Hindlll plus fesHII (Bs), 

1 0 and analyzed by Southern blot hybridization. Probe for methylation analysis is indicated in 
Fig. la (Me probe). Detected fragment sizes are as follows: methylated allele of wild type 
Snrpn (WT), 14 kb; methylated allele of agouti knockin (Ag KI), 12 kb; unmethylated allele 
of wild type Snrpn (WT unme), 1 1 kb; unmethylated allele of agouti knockin (Ag KI unme), 
8.5 kb for Hindlll plus Sacll digestion, and 8 kb for Hindlll plus BssHII digestion. 

15 Figures 3a through 3b demonstrate the effect of the transgene on coat color. 

Photographs of mice with agouti transgene expressed from the endogenous Snrpn promoter 
on a nonagouti background. Figure 3a shows that when the agouti transgene was inherited 
from a heterozygous male, the offspring carrying the paternally inherited agouti transgene 
(T p /+) showed a distinctly lighter tannish coloration on their ventrum compared to 

20 littermates without the transgene (+/+). Figure 3b shows that when the agouti transgene was 
inherited from a heterozygous female, the offspring (T m /+) were entirely black and were 
indistinguishable from littermates not carrying the transgene. 

Figures 4a through 4b show expression of Snrpn and Snrpnl agouti fusion transcripts 
in the skin of the agouti mice with the knockin/fusion sequence during three generations of 

25 breeding. Figure 4a indicate partial structure of the Snrpn wild type and knockin/fusion 
sequence. Symbols are the same as in Figure la. Arrows above the boxes indicate 
transcriptional orientation of the Snrpn and neomycin genes. Arrowheads below the exons 
indicate primers used for RT-PCR analysis. Primer p3 was used for RT reaction of Snrpn 
RNA, primers pi and p2 were for PCR of the Snrpn cDNA. Primer p5 was used for RT 

30 reaction of Snrpnl agouti fusion RNA, and primers pi and p4 were for the subsequent PCR 
of the Snrpnl&gouti fusion cDNA. Figure 4b shows RT-PCR analysis was performed for 
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Snrpn, Snrpn/agouti fusion, neomycin, and an internal control gene, Hprt, using total RNA 
isolated from skin of mice 1-1, II-2, and III-l as indicated in the pedigree in Figure 2a. 
Lanes representing RNA incubated with reverse transcriptase are indicated as (+), while (-) 
lanes are reactions incubated without reverse transcriptase as controls for genomic DNA 
5 contamination. 

Figures 5 a through 5b illustrate the strategy for detecting dominant mutations 
affecting genomic imprinting. In Figure 5 a mutagenized males homozygous for the 
transgene and nonagouti a/a are bred to nontransgenic nonagouti a/a females. In Figure 5b 
mutagenized nontransgenic nonagouti a/a males are bred to females homozygous for the 
10 transgene and nonagouti a/a. 

Figure 6 illustrates a mutant mouse with tannish abdomen compared to nonmutant 
littermate. This mutant animal was discovered in the screen mating a mutagenized wild type 
male with a homozygous transgenic female. All offspring in this screen have a black 
abdomen except for the one mutant animal depicted. 
1 5 Figure 7 shows a Southern blot demonstrating lack of methylation of the maternally 

inherited knockin/flision gene in the mutant mouse. Southern blotting was performed using 
the same probe and restriction enzyme combination as for Figure 2. 

Figure 8 shows preparation of bicistronic agouti and GFP reporter under control of 
the Snrpn promoter. The agouti (Ag) cDNA was placed under the control of the Snrpn 
20 promoter as for the construct shown in Figure 1 . 

Figure 9 depicts generation of two fusion gene constructs expressing human a- 
fetoprotein under control of the promoter for the LDL receptor. 

25 DETAILED DESCRIPTION OF THE INVENTION 

It is readily apparent to one skilled in the art that various embodiments and 
modifications may be made to the invention disclosed in this Application without departing 
from the scope and spirit of the invention. 
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The term "aquatic organism" as used herein is defined as freshwater or saltwater 
organisms. Examples would be fish, shrimp, scallops, mussels and other fresh or salt water 
food organisms. 

The term "compound" means any protein, peptide, nucleic acid, DNA, RNA, 
5 oligonucleotide, hormone, sugar, lipid and other inorganic and organic chemical including 
metals or combinations thereof. 

The term "DNA" as used herein is defined as deoxyribonucleic acid. 

The term "downregulator" as used herein is defined as a compound or molecule 
which decreases expression of a particular nucleotide sequence. The decreased expression 
1 0 means less expression than the cell or organism is expressing without the downregulator. 
This means there is less expression than is present in the native cell, which can be measured 
before the administration of the downregulator. A downregulator can be any compound or 
mutant sequence. 

The term "elements" as used herein is defined as a compound, mutagen, or any other 
1 5 entity which causes an effect on regulation of expression of said nucleotide sequence. 

The term "expression" as used herein is defined as the transcription and/or 
translation of a particular nucleotide sequence driven by its promoter. 

The term "fusion gene" as used herein is used interchangeably with the term 
"knockin" or "knockin/fusion" and is defined as an insertion of a first nucleic acid into a 
20 second nucleic acid, wherein said insertion creates a novel DNA joint. Thus, the first and 
second nucleic acids are therein made contiguous, and in specific embodiments the 
translational and/or transcriptional signals which affect one, affect the other. In a preferred 
embodiment a first nucleic acid is an easily detectable reporter gene or bicistronic reporter 
cassette. In another preferred embodiment the second nucleic acid, is a nucleic acid of 
25 interest, the promoter of which, upon creation of the fusion gene, regulates expression of the 
reporter gene or bicistronic reporter cassette. In another preferred embodiment creation of 
the fusion gene occurs through homologous recombination. 

The term "homologous recombination" as used herein is defined as an exchange 
between any pair of DNA sequences having a similar sequence of nucleotides, where the 
30 two sequences interact (recombine) to form a new recombinant DNA species. The 
frequency of homologous recombination increases as the length of the shared nucleotide 
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DNA sequences increase. Homologous recombination can occur between two DNA 
sequences that are less than identical, but the recombination frequency declines as the 
divergence between the two sequences increases. 

The term "knockin" as used herein is used interchangeably with the term "fusion 
5 gene" or the act of creating a "fusion gene". This term is also used interchangeable with 
"knockin/fusion." 

The term "peptide" as used herein is defined as a molecule of at least 1 0 amino acids 
in length. 

A compound or composition is said to be "pharmacologically acceptable" if its 
10 administration can be tolerated by a recipient mammal. Such an agent is said to be 
administered in a "pharmacologically effective dose" if the amount administered is 
physiologically significant. An agent is physiologically significant if its presence results in 
technical change in the physiology of a recipient mammal. For example, in the treatment 
of a disease or condition, a compound which inhibits tumor growth or decreases the size of 
1 5 the tumor would be therapeutically effective; a compound which slows the progression of 
the disease, prevents the onset, delays the onset or completely treats the disease, would be 
considered therapeutically effective. 

The term "promoter" as used herein is defined as the region of nucleotide sequence 
which regulates transcription of a specific nucleotide sequence. The term promoter includes 
20 enhancers, silencers and other czs-acting regulatory elements. 

The term "RNA" as used herein is defined as ribonucleic acid. 
The term "reporter sequence" as used herein is defined as the nucleotide sequence 
which when expressed can be detected. The expressed product itself can be detected, such 
as an RNA or protein, or a metabolite or other characteristic secondarily affected by the 
25 reporter product can be detected. The skilled artisan recognizes that any reporter gene that 
could be detected by transcutaneous monitoring, by visualization with UV light, by 
visualization with infrared light, or by visualization with other imaging techniques, such as 
X-ray or MRI, would be of obvious value. Any tissue or body fluid or cell culture or cell 
free extract can be sampled depending on the marker used. For example, coat color, hair 
3 0 color, skin quality, eye color, secreted proteins, histological markers and other markers used 
by those skilled in the art. 
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The term "transcription" as used herein is defined as the synthesis of an RNA 
directed by a DNA template. 

The term "translation" as used herein is defined as the synthesis of a peptide directed 
by an RNA template. 

5 The term "upregulator" as used herein is defined as a compound or molecule which 

increases expression of a particular nucleotide sequence. The increased expression means 
greater expression than the cell or organism is expressing without the upregulator. This 
means there is more expression than is present in the native cell, which can be measured 
before the administration of the upregulator. An upregulator can be any compound or 
10 mutant sequence. 

One specific embodiment of the present invention is a system to determine elements 
that regulate expression of a nucleotide sequence comprising a reporter sequence inserted 
within said nucleotide sequence under conditions wherein the expression of the reporter 

1 5 sequence is controlled by the promoter for said nucleotide sequence. A skilled artisan has 
available the sequences which may be utilized in the present invention as described herein 
or as is readily available at nationally maintained sequence repositories such as GenBank 
(http://www.ncbi.nlm.nih.gov/Entrez/index.html). Similarly, a skilled artisan may obtain 
cell lines and other biological materials through an archive such as the American Type Cell 

20 Culture (ATCC) (http://phage.atcc.org/searchengine/all.html) or through other similar 
repositories. Also, a skilled artisan has available databases such as are available at 
http://www.ond.gov/TechResources/Trans/hmepg.html or http://tbase.jax.org/, in which 
over 800 records of animal lines are maintained, with such detailed information as the 
methods for the generation of each line, a detailed description of the DNA construct used, 

25 expression, phenotype, etc. 

Another specific embodiment of the present invention is a method to identify an 
upregulator or downregulator of expression for a specific nucleotide sequence comprising 
the steps of inserting a test system into a test organism, wherein the test system comprises 
a reporter sequence inserted into a specific nucleotide sequence under conditions wherein 

30 the expression of the reporter sequence is controlled by the promoter for the specific 
nucleotide sequence, delivering a compound, and measuring the effect of the compound on 
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the reporter sequence, wherein when said compound increases the expression of said 
reporter sequence, the compound is an upregulator and when said compound decreases the 
expression of the reporter sequence, the compound is a downregulator. 

Another specific embodiment of the present invention is a method to identify an 
5 upregulator or downregulator of expression of a specific nucleotide sequence comprising the 
steps of inserting a test system into a test organism, wherein the test system comprises a 
reporter sequence inserted into a specific nucleotide sequence under conditions, wherein the 
expression of the reporter sequence is controlled by the promoter, mutagenizing the test 
organism containing the test system and measuring the effect of the mutagenesis on 

1 0 expression of the reporter sequence expression wherein, when the mutant resulting from said 
mutagenizing increases the expression of the reporter sequence, the mutant is an upregulator 
and when the mutant resulting from said mutagenizing decreases the expression of said 
reporter sequence, the mutant is a downregulator. In a further embodiment a test compound 
is delivered to the mutagenized test organism and its effects on regulation of reporter 

15 sequence expression is similarly measured. 

In preferred embodiments of the present invention the reporter sequence is two 
different reporter sequences which are inserted into the specific nucleotide sequence and 
both are controlled by the promoter of the specific nucleotide sequence. One skilled in the 
art will recognize specific advantages of using two different reporters in removing the 

20 effects of randomness, mutations, etc. Also one of the reporters can be a quick screen, like 
hair or eye color and the other could be a chemical screen like CAT, luciferase, etc. 

Specific embodiments also include composition of matter wherein a system as 
described herein is functional and is inserted into the genome of an organism selected from 
the group consisting of non-human mammal, non-mammalian animal, plant, insect, aquatic 

25 organism and an avian species. In other specific embodiments the method described herein 
can use the same above group of organisms as a test organism. 

Another specific embodiment includes a cell, cell line, or cell free extract wherein 
a functional system as described herein is inserted. The cell, cell line, or cell free extract can 
be used as the test organism in any of the methods described herein. In addition to the 

30 organisms described above, human cell lines are also useful. In a specific embodiment 
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tissue culture cells are utilized for drug or genetic screening through methods of the present 
invention. 

In a preferred embodiment a fusion gene is created and introduced into cultured 
mouse ES cells, followed by transmission of the mouse ES alterations to the germline by 
5 methods well known in the art, and cultured human cells. In a specific embodiment a 
reporter sequence is inserted, such as by homologous recombination, into a nucleic acid of 
interest. Such an insertion may or may not disrupt the gene of interest to the extent that a 
functional gene product is not produced. It is preferred that if an insertion disrupts the 
nucleic acid of interest, such a mutation is tolerated in heterozygous form, although it is 

10 possible that in some cases homozygosity regarding the fusion gene in which inactivation 
of the endogenous nucleic acid may alternatively be tolerated. However, as described in the 
Examples herein, the fusion need not inactivate the nucleic acid of interest. 

In an alternative embodiment a fusion gene is created which produces a protein 
fusion between the product of the reporter sequence and the gene product of the nucleic acid 

15 sequence of interest. However, it is preferred that there is an independent start site for 
translation of the reporter sequence. An additional preferred embodiment is another type 
of fusion gene called a "transcriptional" fusion, which places an intact reporter gene 
downstream from the transcriptional start signal of the target gene. The reporter gene fusion 
genes described herein are preferably quantitated to reflect quantitation of the expression of 

20 the nucleic acid of interest. In accordance with an object of the present invention, the 
system wherein a fusion gene is created between a reporter sequence and endogenous 
nucleic acid of interest in mammalian systems is utilized to perform high throughput drug 
screening and/or high throughput genetic screening using cells such as mouse cells in tissue 
culture, human cells in tissue culture, or intact animals such as mice. In a specific 

25 embodiment the drug screening assays for upregulators or downregulators of expression of 
the reporter sequence, and thus the nucleic acid sequence of interest. 

The skilled artisan will readily recognize that the specific nucleotide sequence which 
is being used to look for upregulators or downregulators does not need to derive from the 
nucleotide sequence naturally found in the test organism. For example, a human sequence 

30 of interest, the sequence coding for apolipoprotein A-I, could be inserted into a mouse, 
another species or a cell line or cell free extract and then tested in that species or cell line or 
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cell free extract. Similarly the nucleotide sequence could be from the same species, for 
example mouse growth hormone tested in mouse for upregulation or downregulation. 

Similarly to the specific nucleotide sequence the reporter sequence does not have to 
be from the same organism, although it can be. For example, as seen in the Examples 
5 provided herein, the agouti sequence is used in mice and is a natural mouse sequence. It 
provides an easy method to select mice, i.e. on coat color. On the other hand, luciferase or 
green fluorescent protein could be used in a mouse as reporter sequence even though they 
are not native mouse sequences. Another example would be the use of the human a- 
fetoprotein or human growth hormone as a reporter sequence in a mouse. When two or 

1 0 more reporter sequences are used the reporter sequences themselves could be from the same 
species or different species. In addition all, some or none of the reporter sequences could 
be from the test organism. 

One skilled in the art regularly recognizes that test organisms should be selected 
based on availability in the laboratory as well as the ease of which a marker or reporter 

1 5 sequence can be monitored. In specific embodiments the nucleotide sequence of the present 
invention is selected from the group of organisms consisting of mammals, non-mammalian 
animals, plants, insects, aquatic organisms and avian species. 

Test organisms can be whole animals (mice and rats), tissue or organs (heart, lung, 
kidney), tissue slices, cell lines or cell extracts from various organisms. The cell lines and 

20 cell free extracts can be from an organism of interest, for example humans, or from a 
specific tissue or organ of interest such as brain, liver, heart. One cell line that has been 
useful in the present invention is human stem cells. In many instances cell lines from insects 
and plants can be used in order to look at regulation of expression of the nucleotide 
sequence. In many cases the use of organisms which do not have any commonality with the 

25 reporter sequence makes the reporter sequence easier to detect. 

Another application of the present invention is the use of the system to detect 
upregulators or downregulators to enhance food production and nutritional value in animals 
used for food. An example of this would be to search for upregulators of the growth 
hormone. In this case, once the upregulator is found it could be introduced into the 

30 organism of interest, for example sheep, lamb, cow, pig, fish, or shrimp in order to enhance 
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protein production. Thus, the skilled artisan recognizes there are various applications for 
the food industry and for animals used as food sources. 

Another specific embodiment is the method of treatment of a disease in which there 
is deficiency of expression or excess expression. The skilled artisan recognizes that a 
5 variety of diseases can be treated this way. Some specific examples include: (a) Duchenne 
muscular dystrophy wherein the utrophin nucleotide sequence can be used to screen for 
upregulators and then the upregulators can be delivered to patients in pharmacologically 
effective doses to treat the disease; (b) Alzheimer's whereas the amyloid A beta precursor 
protein (APP) or presinilin-1 nucleotide sequence can be used to screen for downregulators 

10 and the downregulators can then be delivered to patients in pharmacologically effective 
doses; (c) the apolipoprotein A-I and the LDL receptor can be used to test for compounds 
useful in the treatment of high cholesterol by using pharmacologically effective doses of 
upregulators to regulate the production of apolipoprotein A-I or LDL receptor nucleotide 
sequences; (d) tumor suppressor genes can be upregulated to treat and suppress cancer; and 

15 (e) growth hormone disease can be treated using either upregulators or downregulators to 
treat the various conditions associated with excess or deficient expression of growth 
hormone. 

One skilled in the art recognizes that there are a variety of reporter sequences which 
may be utilized. One strength of the invention is the use of reporter sequences that are easily 

20 detected or quantitated. As in the Examples, this can involve easily visible markers, such 
as coat color (agouti gene). A variety of other strategies to affect coat color are also 
possible. These would include the use of secreted proteins such as 
a-melanocyte-stimulating-hormone (a-MSH) which can affect skin, eye and hair 
pigmentation when secreted into the circulation from any part of the body. Other reporter 

25 genes that are useful include secreted proteins, such as human growth hormone (HGH), 
human a- fetoprotein (hAFP), or mouse a- fetoprotein (mAFP), and in the mouse where 
sensitive assays can detect very low levels of compounds in biological fluids or tissues 
including the blood or urine that may have been secreted by a small number of cells in 
distant parts of the organism. In this case, the ease of detection does require sampling of 

30 biological fluids or tissues including blood or urine from each individual animal for 
quantitation of the reporter protein or a metabolite secondarily affected by the reporter 
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protein. Other reporter proteins of interest would obviously include histological markers 
such as chloramphenicol acetyltransferase (CAT), green fluorescent protein (GFP), 
enhanced GFP, blue fluorescent protein, p- galactosidase and p-glucuronidase. Luciferase 
is another widely utilized reporter gene that is being tested in this system. The reporter gene 
5 containing an epitope tag can also be monitored. Examples of epitope tags include HA, myc 
and Flag. 

In general, detection of the reporter means direct measurement or any other effect 
caused secondarily by the reporter, such as (1) change in the level of a metabolite in blood 
or urine, or (2) induction of drug resistance or sensitivity in whole animals or tissue culture 
10 cells. 

The skilled artisan also readily recognizes that the reporter sequence can be inserted 
anywhere within the specific nucleotide sequence as long as it is expressed by the promoter 
controlling the specific nucleotide sequence. In specific embodiments it has been found that 
the reporter sequence can be inserted at either the 5' or 3' portion of the coding sequence, in 

15 an intron or within the 3' untranslated portion of the nucleotide sequence. In specific 
embodiments the reporter sequence is inserted in that part of the sequence which codes for 
the N-terminal or C-terminal portion of the protein. In other embodiments it is important 
that the reporter be inserted in a location so that a knockin/fiision product between the 
knockin (reporter sequence) and the specific nucleotide sequence is formed. This is very 

20 helpful in terms of using this method for detecting posttranscriptional events at either the 
RNA or protein level. One skilled in the art also readily recognizes that when the receptor 
sequence becomes a knockin (i.e. inserted into the specific nucleotide sequence), the specific 
nucleotide sequence itself may not be expressed. As long as this is not lethal to the 
organism being tested, the methods of the present invention will .work to allow for the 

25 termination of upregulators and downregulators of the promoter which regulates the specific 
sequence. One skilled in the art recognizes that if it is lethal then the location for insertion 
needs to be adjusted. Examples of this are shown below. 

It is readily apparent to one skilled in the art that the relationship between a promoter 
and its regulatory elements and the reporter sequence may be addressed by using methods 

30 other than homologous recombination in ES cells or other cells to prepare the 
knockin/fusion sequence. It is obvious that any other method of recombination that would 
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test the activity of the promoter or test for posttranscriptional effects by linking a reporter 
sequence to the promoter, to the transcript, or to the protein would be included within the 
scope of the present invention. Also, the invention may be utilized in vitro with fragments 
of genes or cell extracts. 
5 The skilled artisan recognizes there are a variety of ways to practice the present 

invention. Some examples include introducing reporter sequence into ES cells by 
homologous or other recombination and obtain germline transmission. Then a skilled 
artisan may study animals and/or cultured cells derived from the animals. Alternatively the 
reporter sequence can be introduced into cultured cells either by homologous or non- 
1 0 homologous recombination or as an extrachromosal nucleic acid or similar strategy and the 
cultured cells studied. This can be done for animals, including humans, and especially 
human stem cells. 

In the transcriptional application, the promoter or other regulatory element such as 
an enhancer will be used to control the reporter sequence. This may involve a fusion 

15 transcript or not. Some variations include: (1) the promoter of the specific nucleotide 
sequence of interest is fused to reporter sequence such that ATG codon for start of 
translation is part of the reporter sequence; (2) similar to (1) except that the ATG codon of 
the specific nucleotide sequence of interest is used resulting in a small but functionally 
insignificant N-terminal addition of amino acids to the reporter product; and (3) an enhancer 

20 or silencer from the specific nucleotide sequence of interest is fused to a minimal promoter 
to express the reporter sequence such that the promoter of the specific nucleotide sequence 
of interest is not used while testing other cis elements such as enhancers or silencers. 

One embodiment of this invention is to carry out mutagenesis in mice, other 
mammalian organisms, and other species. The Examples provided herein present ENU 

25 mutagenesis in the mouse, which is well known by skilled artisans to be one of the most 
powerful methods of introducing mutations into the mouse at the present time. However, it 
is also known by the skilled artisan that it is very difficult to identify, recover and 
demonstrate the cause and effect relationship with point mutations introduced by ENU 
mutagenesis. These mutations are readily mapped and are increasingly easily characterized 

30 withprogress on the sequencing of the mouse genome, but other mutagenesis strategies have 
potential advantages. Insertional mutagenesis, sometimes referred to as transposon tagging, 
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has the major advantage of providing for the easy recovery of the mutated gene. Thus, a 
wide variety of mutagens, including chemicals, X-ray, UV, and biological agents are useful. 
One skilled in the art recognizes that any mutagenizing method wherein the mutant sequence 
can be identified is useful in this procedure. 

5 

NUCLEIC ACID-BASED EXPRESSION SYSTEMS 
1. Vectors 

The term "vector" is used to refer to a carrier nucleic acid molecule into which a 
nucleic acid sequence can be inserted for introduction into a cell where it can be replicated. 

10 A nucleic acid sequence can be "exogenous," which means that it is foreign to the cell into 
which the vector is being introduced or that the sequence is homologous to a sequence in the 
cell but in a position within the host cell nucleic acid in which the sequence is ordinarily not 
found. Vectors include plasmids, cosmids, viruses (bacteriophage, animal viruses, and plant 
viruses), and artificial chromosomes (e.g., YACs). One of skill in the art would be well 

15 equipped to construct a vector through standard recombinant techniques, which are 
described in Maniatis et al., 1988 and Ausubel et al., 1994, both incorporated herein by 
reference. 

The term "expression vector" refers to a vector containing a nucleic acid sequence 
coding for at least part of a gene product capable of being transcribed. In some cases, RNA 

20 molecules are then translated into a protein, polypeptide, or peptide. In other cases, these 
sequences are not translated, for example, in the production of antisense molecules or 
ribozymes. Expression vectors can contain a variety of "control sequences," which refer to 
nucleic acid sequences necessary for the transcription and possibly translation of an operably 
linked coding sequence in a particular host organism. In addition to control sequences that 

2 5 govern transcription and translation, vectors and expression vectors may contain nucleic acid 
sequences that serve other functions as well and are described infra. 

a. Promoters and Enhancers 

A "promoter" is a control sequence that is a region of a nucleic acid sequence at 
30 which initiation and rate of transcription are controlled. It may contain genetic elements at 
which regulatory proteins and molecules may bind such as RNA polymerase and other 
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transcription factors. The phrases "operatively positioned," "operatively linked," "under 
control," and "under transcriptional control" mean that a promoter is in a correct functional 
location and/or orientation in relation to a nucleic acid sequence to control transcriptional 
initiation and/or expression of that sequence. A promoter may or may not be used in 
5 conjunction with an "enhancer," which refers to a cis-acting regulatory sequence involved 
in the transcriptional activation of a nucleic acid sequence. 

A promoter may be one naturally associated with a gene or sequence, as may be 
obtained by isolating the 5' non-coding sequences located upstream of the coding segment 
and/or exon. Such a promoter can be referred to as "endogenous." Similarly, an enhancer 

1 0 may be one naturally associated with a nucleic acid sequence, located either downstream or 
upstream of that sequence. Alternatively, certain advantages will be gained by positioning 
the coding nucleic acid segment under the control of a recombinant or heterologous 
promoter, which refers to a promoter that is not normally associated with a nucleic acid 
sequence in its natural environment. A recombinant or heterologous enhancer refers also 

15 to an enhancer not normally associated with a nucleic acid sequence in its natural 
environment. Such promoters or enhancers may include promoters or enhancers of other 
genes, and promoters or enhancers isolated from any other prokaryotic, viral, or eukaryotic 
cell, and promoters or enhancers not "naturally occurring," i.e., containing different elements 
of different transcriptional regulatory regions, and/or mutations that alter expression. In 

20 addition to producing nucleic acid sequences of promoters and enhancers synthetically, 
sequences may be produced using recombinant cloning and/or nucleic acid amplification 
technology, including PCR™, in connection with the compositions disclosed herein (see 
U.S. Patent 4,683,202, U.S. Patent 5,928,906, each incorporated herein by reference). 
Furthermore, it is contemplated the control sequences that direct, transcription and/or 

25 expression of sequences within non-nuclear organelles such as mitochondria, chloroplasts, 
and the like, can be employed as well. 

Naturally, it will be important to employ a promoter and/or enhancer that effectively 
directs the expression of the DNA segment in the cell type, organelle, and organism chosen 
for expression. Those of skill in the art of molecular biology generally know the use of 

30 promoters, enhancers, and cell type combinations for protein expression, for example, see 
Sambrook et al. ( 1989), incorporated herein by reference. The promoters employed may be 
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constitutive, tissue-specific, inducible, and/or useful under the appropriate conditions to 
direct high level expression of the introduced DNA segment, such as is advantageous in the 
large-scale production of recombinant proteins and/or peptides. The promoter may be 
heterologous or endogenous. 
5 The identity of tissue-specific promoters or elements, as well as assays to 

characterize their activity, is well known to those of skill in the art. Examples of such 
regions include the human LIMK2 gene (Nomoto et al. 1999), the somatostatin receptor 2 
gene (Kraus et al., 1998), murine epididymal retinoic acid-binding gene (Lareyre et al., 
1999), human CD4 (Zhao-Emonet et al., 1998), mouse alpha2 (XI) collagen (Tsumaki, et 
10 al., 1998), Dl A dopamine receptor gene (Lee, et al., 1997), insulin-like growth factor II (Wu 
et al., 1997), human platelet endothelial cell adhesion molecule- 1 (Almendro et al., 1996). 

b. Initiation Signals and Internal Ribosome Binding Sites 

15 A specific initiation signal also may be required for efficient translation of coding 

sequences. These signals include the ATG initiation codon or adjacent sequences. 
Exogenous translational control signals, including the ATG initiation codon, may need to 
be provided. One of ordinary skill in the art would readily be capable of determining this 
and providing the necessary signals. It is well known that the initiation codon must be 

20 "in-frame" with the reading frame of the desired coding sequence to ensure translation of the 
entire insert. The exogenous translational control signals and initiation codons can be either 
natural or synthetic. The efficiency of expression may be enhanced by the inclusion of 
appropriate transcription enhancer elements. 

In certain embodiments of the invention, the use of internal ribosome entry sites 

25 (IRES) elements are used to create multigene, or polycistronic, messages. IRES elements 
are able to bypass the ribosome scanning model of 5' methylated Cap dependent translation 
and begin translation at internal sites (Pelletier and Sonenberg, 1988). IRES elements from 
two members of the picornavirus family (polio and encephalomyocarditis) have been 
described (Pelletier and Sonenberg, 1988), as well an IRES from a mammalian message 

30 (Macejak and Sarnow, 1991). IRES elements can be linked to heterologous open reading 
frames. Multiple open reading frames can be transcribed together, each separated by an 
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IRES, creating polycistronic messages. By virtue of the IRES element, each open reading 
frame is accessible to ribosomes for efficient translation. Multiple genes can be efficiently 
expressed using a single promoter/enhancer to transcribe a single message (see U.S. Patent 
5,925,565 and 5,935,819, herein incorporated by reference). 

5 

c. Multiple Cloning Sites 

Vectors can include a multiple cloning site (MCS), which is a nucleic acid region 
that contains multiple restriction enzyme sites, any of which can be used in conjunction with 
standard recombinant technology to digest the vector. (See Carbonelli et al., 1999, 

10 Levenson et al., 1998, and Cocea, 1997, incorporated herein by reference.) "Restriction 
enzyme digestion" refers to catalytic cleavage of a nucleic acid molecule with an enzyme 
that functions only at specific locations in a nucleic acid molecule. Many of these restriction 
enzymes are commercially available. Use of such enzymes is widely understood by those 
of skill in the art. Frequently, a vector is linearized or fragmented using a restriction enzyme 

15 that cuts within the MCS to enable exogenous sequences to be ligated to the vector. 
"Ligation" refers to the process of forming phosphodiester bonds between two nucleic acid 
fragments, which may or may not be contiguous with each other. Techniques involving 
restriction enzymes and ligation reactions are well known to those of skill in the art of 
recombinant technology. 

20 

d. Splicing Sites 

Most transcribed eukaryotic RNA molecules will undergo RNA splicing to remove 
introns from the primary transcripts. A skilled artisan is aware that genomic eukaryotic 
sequences may require donor and/or acceptor splicing sites to ensure proper processing of 
25 the transcript for protein expression. (See Chandler et al., 1997, herein incorporated by 
reference.) 

e. Polyadenylation Signals 

In expression, one will typically include a polyadenylation signal to effect proper 
30 polyadenylation of the transcript. The nature of the polyadenylation signal is not believed 
to be crucial to the successful practice of the invention, and/or any such sequence may be 
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employed. Preferred embodiments include the SV40 polyadenylation signal and/or the 
bovine growth hormone polyadenylation signal, convenient and/or known to function well 
in various target cells. Also contemplated as an element of the expression cassette is a 
transcriptional termination site. These elements can serve to enhance message levels and/or 
5 to minimize read through from the cassette into other sequences. 

f. Origins of Replication 

In order to propagate a vector in a host cell, it may contain one or more origins of 
replication sites (often termed "ori"), which is a specific nucleic acid sequence at which 
10 replication is initiated. Alternatively an autonomously replicating sequence (ARS) can be 
employed if the host cell is yeast. 

g. Selectable and Screenable Markers 

In certain embodiments of the invention, the cells contain nucleic acid construct of 

1 5 the present invention, a cell may be identified in vitro or in vivo by including a marker in the 
expression vector. Such markers would confer an identifiable change to the cell permitting 
easy identification of cells containing the expression vector. Generally, a selectable marker 
is one that confers a property that allows for selection. A positive selectable marker is one 
in which the presence of the marker allows for its selection, while a negative selectable 

20 marker is one in which its presence prevents its selection. An example of a positive 
selectable marker is a drug resistance marker. 

Usually the inclusion of a drug selection marker aids in the cloning and identification 
of transformants, for example, genes that confer resistance to neomycin, puromycin, 
hygromycin, DHFR, GPT, zeocin and histidinol are useful selectable markers. In addition 

25 to markers conferring a phenotype that allows for the discrimination of transformants based 
on the implementation of conditions, other types of markers including screenable markers 
such as GFP, whose basis is colorimetric analysis, are also contemplated. Alternatively, 
screenable enzymes such as herpes simplex virus thymidine kinase (tk) or chloramphenicol 
acetyltransferase (CAT) may be utilized. One of skill in the art would also know how to 

30 employ immunologic markers, possibly in conjunction with FACS analysis. The marker 
used is not believed to be important, so long as it is capable of being expressed 
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simultaneously with the nucleic acid encoding a gene product. Further examples of 
selectable and screenable markers are well known to one of skill in the art. 

2. Host Cells 

5 As used herein, the terms "cell," "cell line," and "cell culture" may be used 

interchangeably. All of these term also include their progeny, which is any and all 
subsequent generations. It is understood that all progeny may not be identical due to 
deliberate or inadvertent mutations. In the context of expressing a heterologous nucleic acid 
sequence, "host cell" refers to a prokaryotic or eukaryotic cell, and it includes any 

10 transformable organisms that is capable of replicating a vector and/or expressing a 
heterologous gene encoded by a vector. A host cell can, and has been, used as a recipient 
for vectors. A host cell may be "transfected" or "transformed," which refers to a process by 
which exogenous nucleic acid is transferred or introduced into the host cell. A transformed 
cell includes the primary subject cell and its progeny. 

1 5 Host cells may be derived from prokaryotes or eukaryotes, depending upon whether 

the desired result is replication of the vector or expression of part or all of the 
vector-encoded nucleic acid sequences. Numerous cell lines and cultures are available for 
use as a host cell, and they can be obtained through the American Type Culture Collection 
(ATCC), which is an organization that serves as an archive for living cultures and genetic 

20 materials (www.atcc.org). An appropriate host can be determined by one of skill in the art 
based on the vector backbone and the desired result. A plasmid or cosmid, for example, can 
be introduced into a prokaryote host cell for replication of many vectors. Bacterial cells 
used as host cells for vector replication and/or expression include DH5a, JM109, and KC8, 
as well as a number of commercially available bacterial hosts such as SURE® Competent 

25 Cells and SOLOPACKa Gold Cells (STRATAGENE®, La Jolla). Alternatively, bacterial 
cells such as E. coli LE392 could be used as host cells for phage viruses. 

Examples of eukaryotic host cells for replication and/or expression of a vector 
include HeLa, NIH3T3, Jurkat, 293, Cos, CHO, Saos, and PC12. Many host cells from 
various cell types and organisms are available and would be known to one of skill in the art. 

30 Similarly, a viral vector may be used in conjunction with either a eukaryotic or prokaryotic 
host cell, particularly one that is permissive for replication or expression of the vector. 
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Some vectors may employ control sequences that allow it to be replicated and/or 
expressed in both prokaryotic and eukaryotic cells. One of skill in the art would further 
understand the conditions under which to incubate all of the above described host cells to 
maintain them and to permit replication of a vector. Also understood and known are 
5 techniques and conditions that would allow large-scale production of vectors, as well as 
production of the nucleic acids encoded by vectors and their cognate polypeptides, proteins, 
or peptides. 

3. Expression Systems 

10 Numerous expression systems exist that comprise at least a part or all of the 

compositions discussed above. Prokaryote- and/or eukaryote-based systems can be 
employed for use with the present invention to produce nucleic acid sequences, or their 
cognate polypeptides, proteins and peptides. Many such systems are commercially and 
widely available. 

1 5 The insect cell/baculovirus system can produce a high level of protein expression of 

a heterologous nucleic acid segment, such as described in U.S. Patent No. 5,871,986, 
4,879,236, both herein incorporated by reference, and which can be bought, for example, 
under the name MAXBAC® 2.0 from INVITROGEN® and BACPACK™ 
BACULO VIRUS EXPRESSION SYSTEM FROM CLONTECH®. 

20 Other examples of expression systems include STRATAGENE®'s COMPLETE 

CONTROLa Inducible Mammalian Expression System, which involves a synthetic 
ecdysone-inducible receptor, or its pET Expression System, an E. coli expression system. 
Another example of an inducible expression system is available from INVITROGEN®, 
which carries the T-REX™ (tetracycline-regulated expression) System, an inducible 

25 mammalian expression system that uses the full-length CMV promoter. INVITROGEN® 
also provides a yeast expression system called the Pichia methanolica Expression System, 
which is designed for high-level production of recombinant proteins in the methylotrophic 
yeast Pichia methanolica. One of skill in the art would know how to express a vector, such 
as an expression construct, to produce a nucleic acid sequence or its cognate polypeptide, 

30 protein, or peptide. 
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Derivatives of Promoter Sequences 

One aspect of the invention provides derivatives of specific promoters. One means 
for preparing derivatives of such promoters comprises introducing mutations into the 
promoter sequences. Such mutants may potentially have enhanced, reduced, or altered 
5 function relative to the native sequence, or alternatively, may be silent with regard to 
function. 

Mutagenesis may be carried out at random and the mutagenized sequences screened 
for function. Alternatively, particular sequences which provide the promoter region with 
desirable expression characteristics could be identified and these or similar sequences 

1 0 introduced into other related or non-related sequences via mutation. Similarly, non-essential 
elements may be deleted without significantly altering the function of the promoter. It is 
further contemplated that one could mutagenize these sequences in order to enhance their 
utility in expressing transgenes, especially in a gene therapy construct in humans. 

The means for mutagenizing a DNA segment comprising a specific promoter 

15 sequence are well-known to those of skill in the art. Mutagenesis may be performed in 
accordance with any of the techniques known in the art, such as, and not limited to, 
synthesizing an oligonucleotide having one or more mutations within the sequence of a 
particular regulatory region. In particular, site-specific mutagenesis is a technique useful 
in the preparation of promoter mutants, through specific mutagenesis of the underlying 

20 DNA. The technique further provides a ready ability to prepare and test sequence variants, 
by introducing one or more nucleotide sequence changes into the DNA. 

Site-specific mutagenesis allows the production of mutants through the use of 
specific oligonucleotide sequences which encode the DNA sequence of the desired mutation, 
as well as a sufficient number of adjacent nucleotides, to provide a primer sequence of 

25 sufficient size and sequence complexity to form a stable duplex on both sides of the deletion 
junction being traversed. Typically, a primer of about 13 to about 75 nucleotides or more 
in length is preferred, with about 10 to about 25 or more residues on both sides of the 
junction of the sequence being altered. 

In general, the technique of site-specific mutagenesis is well known in the art, as 

30 exemplified by various publications. As will be appreciated, the technique typically 
employs a phage vector which exists in both a single stranded and double stranded form. 
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Typical vectors useful in site-directed mutagenesis include vectors such as the Ml 3 phage. 
These phage are readily commercially available and their use is generally well known to 
those skilled in the art. Double stranded plasmids also are routinely employed in site 
directed mutagenesis to eliminate the step of transferring the gene of interest from aplasmid 
5 to a phage. 

Alternatively, the use of PCR™ with commercially available thermostable enzymes 
such as Taq polymerase may be used to incorporate a mutagenic oligonucleotide primer into 
an amplified DNA fragment that can then be cloned into an appropriate cloning or 
expression vector. The PCR™-mediated mutagenesis procedures of Tomic et al ( 1 990) and 

10 Upender et al (1995) provide two examples of such protocols. 

The preparation of sequence variants of the selected promoter or intron-encoding 
DNA segments using site-directed mutagenesis is provided as a means of producing 
potentially useful species and is not meant to be limiting as there are other ways in which 
sequence variants of DNA sequences may be obtained. For example, recombinant vectors 

15 encoding the desired promoter sequence may be treated with mutagenic agents, such as 
hydroxylamine, to obtain sequence variants. 

Typically, vector mediated methodologies involve the introduction of the nucleic 
acid fragment into a DNA or RNA vector, the clonal amplification of the vector, and the 
recovery of the amplified nucleic acid fragment. Examples of such methodologies are 

20 provided by U.S. Patent No. 4,237,224, incorporated herein by reference. A number of 
template dependent processes are available to amplify the target sequences of interest 
present in a sample, such methods being well known in the art and specifically disclosed 
herein. 

One efficient, targeted means for preparing mutagenized promoters or enhancers 
25 relies upon the identification of putative regulatory elements within the target sequence. 

These can be identified, for example, by comparison with known promoter sequences. 

Sequences which are shared among genes with similar functions or expression patterns are 

likely candidates for the binding of transcription factors and are likely elements to confer 

tissue specific expression patterns. 
30 One of skill in the art will recognize that regulatory elements may be included in 

regions of the gene other than the 5'-untranslated region, and comparison of coding and 
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3'-noncoding regions of genes may identify putative regulatory elements. Confirmation of 
putative regulatory elements can be achieved by deletion, duplication, or other alteration or 
mutation of each putative regulatory region followed by functional analysis of each 
construct by assay of a reporter gene which is functionally attached to each construct. As 
5 such, once a starting promoter sequence is provided, any of a number of different functional 
mutants of the starting sequence could be readily prepared using methods well known in the 
art (Zhang et al, 1997). 

Mutation, alteration, duplication, or truncation mutants the promoter region of the 
invention could be randomly prepared or prepared by selection of regions identified as 

10 containing putative regulatory elements, and then assayed. With this strategy, a series of 
constructs are prepared, each containing a different portion of the clone (a subclone), either 
mutated or altered or containing wild-type sequence, and these constructs are then screened 
for activity. A suitable means for screening for activity is to attach such a promoter 
construct to a selectable or screenable marker gene and to assay for gene expression. 

15 Other assays may be used to identify responsive elements in a promoter region or 

gene. Such assays will be known to those of skill in the art (see for example, Sambrook et 
al. 9 1989; Zhang et ah 1997; Shan et aL, 1997; Dai and Burnstein, 1996; Cleutjens et al, 
1997; Ng et aL, 1994; Shida et al, 1993), and include DNase I footprinting studies, 
Elecromobility Shift Assay patterns (EMS A), the binding pattern of purified transcription 

20 factors, effects of specific transcription factor antibodies in inhibiting the binding of a 
transcription factor to a putative responsive element, Western analysis, nuclear run-on 
assays, and DNA methylation interference analysis. 

Preferred promoter constructs may be identified that retain the desired, or even 
enhanced, activity (e.g., tissue-specific expression). The smallest segment required for 

25 activity may be identified through comparison of the selected deletion or mutation 
constructs. Once identified, such segments may be duplicated, mutated, or combined with 
other known or regulatory elements and assayed for activity or regulatory properties. 
Promoter region sequences used to identify regulatory elements can also be used to identify 
and isolate transcription factors that bind a putative regulatory sequence or element, 

30 according to standard methods of protein purification, such as affinity chromatography, as 
discussed above. 
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Preferably, identified promoter region sequences, whether used alone or combined 
with additional promoters, enhancers, or regulatory elements, will be induced and/or 
regulated by an external agent, such as a hormone, transcription factor, enzyme, or 
pharmaceutical agent, to express operatively linked genes or sequences (Zhang et al. 9 1997; 
5 Shan et ah, 1 997). Alternatively, such a construct may be designed to cease expression upon 
exposure to an external agent. 

Additionally, deletion mutants may be produced and assayed essentially according 
to Matusik (U.S. Patent No. 5,783,681, July 21, 1998). Plasmids may be constructed 
containing the promoter adjacent to a reporter gene, for example CAT. The construct may 

10 be designed to contain additional regulatory sequences, such as polyadenylation, termination 
and cleavage signals. Deletion mutants may be prepared by a time course treatment of the 
isolated prostate specific transglutaminase promoter with Bal 31 exonuclease (for 15, 30, 
45, 60 and 75 seconds, for example). Following limited digestion, the promoter sequence 
may be ligated to appropriate linker sequences and reinserted into the CAT expression 

1 5 vector. After transformation into an appropriate host cell, such as E. coli, clones containing 
deletion mutants may be screened and their plasmid DNAs digested with restriction enzymes 
that cut at sites flanking the promoter sequence. Promoter size may be determined by 
agarose gel electrophoresis according to standard techniques. 

Following selection of a range of deletion mutants of varying size, the activities of 

20 the deleted promoters for expression of the linked CAT gene may be determined according 
to standard protocols. 

The precise nature of the deleted portion of the promoter may be determined using 
standard DNA sequencing, such as Sanger dideoxy termination sequencing, to identify 
which promoter sequences have been removed in each of the assayed deletion mutants. 

25 Thus, a correlation may be obtained between the presence or absence of specific elements 
within the promoter sequence and changes in activity of the linked reporter gene. 

Assays of Gene Expression 

Assays may be employed within the scope of the instant invention for determination 
30 of the relative efficiency of gene expression. For example, assays may be used to determine 
the efficacy of deletion mutants of specific promoter regions in directing expression of 
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operably linked genes. Similarly, one could produce random or site-specific mutants of 
promoter regions and assay the efficacy of the mutants in the expression of an operably 
linked gene. Alternatively, assays could be used to determine the function of a promoter 
region in enhancing gene expression when used in conjunction with various different 
5 regulatory elements, enhancers, and exogenous genes. 

Gene expression may be determined by measuring the production of RNA, protein 
or both. The gene product (RNA or protein) may be isolated and/or detected by methods 
well known in the art. Following detection, one may compare the results seen in a given cell 
line or individual with a statistically significant reference group of non-transformed control 

1 0 cells. Alternatively, one may compare production of RNA or protein products in cell lines 
transformed with the same gene operably linked to various mutants of a promoter sequence. 
In this way, it is possible to identify regulatory regions within a novel promoter sequence 
by their effect on the expression of an operably linked gene. 

In certain embodiments, it will be desirable to use genes whose expression is 

15 naturally linked to a given promoter or other regulatory element. For example, a prostate 
specific promoter may be operably linked to a gene that is normally expressed in prostate 
tissues. Alternatively, marker genes may be used for assaying promoter activity. Using, for 
example, a selectable marker gene, one could quantitatively determine the resistance 
conferred upon a tissue culture cell line or animal cell by a construct comprising the 

20 selectable marker gene operably linked to the promoter to be assayed. Alternatively, various 
tissue culture cell line or animal parts could be exposed to a selective agent and the relative 
resistance provided in these parts quantified, thereby providing an estimate of the tissue 
specific expression of the promoter. 

Screenable markers constitute another efficient means for quantifying the expression 

25 of a given gene. Potentially any screenable marker could be expressed and the marker gene 
product quantified, thereby providing an estimate of the efficiency with which the promoter 
directs expression of the gene. Quantification can readily be carried out using either visual 
means, or, for example, a photon counting device. 

A preferred screenable marker gene for use with the current invention is 

30 p -glucuronidase (GUS). Detection of GUS activity can be performed histochemically using 
5-bromo-4-chloro-3-indolyl glucuronide (X-gluc) as the substrate for the GUS enzyme, 
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yielding a blue precipitate inside of cells containing GUS activity. This assay has been 
described in detail (Jefferson, 1987). The blue coloration can then be visually scored, and 
estimates of expression efficiency thereby provided. GUS activity also can be determined 
by immunoblot analysis or a fluorimetric GUS specific activity assay (Jefferson, 1987). 

5 

Nucleic Acid Detection 

In accordance with the object of the present invention, there is an embodiment 
wherein the expression of a reporter sequence, an endogenous nucleic acid sequence of 
interest, or a fusion between all of parts of the two are detected. 

10 

1. Hybridization 

The use of a probe or primer of between 1 3 and 1 00 nucleotides, preferably between 
17 and 100 nucleotides in length, or in some aspects of the invention up to 1-2 kilobases or 
more in length, allows the formation of a duplex molecule that is both stable and selective. 

1 5 Molecules having complementary sequences over contiguous stretches greater than 20 bases 
in length are generally preferred, to increase stability and/or selectivity of the hybrid 
molecules obtained. One will generally prefer to design nucleic acid molecules for 
hybridization having one or more complementary sequences of 20 to 30 nucleotides, or even 
longer where desired. Such fragments may be readily prepared, for example, by directly 

20 synthesizing the fragment by chemical means or by introducing selected sequences into 
recombinant vectors for recombinant production. 

Accordingly, the nucleotide sequences of the invention may be used for their ability 
to selectively form duplex molecules with complementary stretches of DNAs and/or RNAs 
or to provide primers for amplification of DNA or RNA from samples. Depending on the 

25 application envisioned, one would desire to employ varying conditions of hybridization to 
achieve varying degrees of selectivity of the probe or primers for the target sequence. 

For applications requiring high selectivity, one will typically desire to employ 
relatively high stringency conditions to form the hybrids. For example, relatively low salt 
and/or high temperature conditions, such as provided by about 0.02 M to about 0. 1 0 M NaCl 

30 at temperatures of about 50°C to about 70°C. Such high stringency conditions tolerate little, 
if any, mismatch between the probe or primers and the template or target strand and would 
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be particularly suitable for isolating specific genes or for detecting specific mRNA 
transcripts. It is generally appreciated that conditions can be rendered more stringent by the 
addition of increasing amounts of formamide. 

For certain applications, for example, site-directed mutagenesis, it is appreciated that 
5 lower stringency conditions are preferred. Under these conditions, hybridization may occur 
even though the sequences of the hybridizing strands are not perfectly complementary, but 
are mismatched at one or more positions. Conditions may be rendered less stringent by 
increasing salt concentration and/or decreasing temperature. For example, a medium 
stringency condition could be provided by about 0.1 to 0.25 M NaCl at temperatures of 

1 0 about 37°C to about 55°C, while a low stringency condition could be provided by about 0. 1 5 
M to about 0.9 M salt, at temperatures ranging from about 20°C to about 55°C. 
Hybridization conditions can be readily manipulated depending on the desired results. 

In other embodiments, hybridization may be achieved under conditions of, for 
example, 50 mM Tris-HCl (pH 8.3), 75 mM KC1, 3 mM MgC12, 1.0 mM dithiothreitol, at 

15 temperatures between approximately 20°C to about 37°C. Other hybridization conditions 
utilized could include approximately 10 mM Tris-HCl (pH 8.3), 50 mM KC1, 1.5 mM 
MgC12, at temperatures ranging from approximately 40°C to about 72°C. 

In certain embodiments, it will be advantageous to employ nucleic acids of defined 
sequences of the present invention in combination with an appropriate means, such as a 

20 label, for determining hybridization. A wide variety of appropriate indicator means are 
known in the art, including fluorescent, radioactive, enzymatic or other ligands, such as 
avidin/biotin, which are capable of being detected. In preferred embodiments, one may 
desire to employ a fluorescent label or an enzyme tag such as urease, alkaline phosphatase 
or peroxidase, instead of radioactive or other environmentally undesirable reagents. In the 

25 case of enzyme tags, colorimetric indicator substrates are known that can be employed to 
provide a detection means that is visibly or spectrophotometrically detectable, to identify 
specific hybridization with complementary nucleic acid containing samples. 

In general, it is envisioned that the probes or primers described herein will be useful 
as reagents in solution hybridization, as in PCR™, for detection of expression of 

30 corresponding genes, as well as in embodiments employing a solid phase. In embodiments 
involving a solid phase, the test DNA (or RNA) is adsorbed or otherwise affixed to a 
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selected matrix or surface. This fixed, single-stranded nucleic acid is then subjected to 
hybridization with selected probes under desired conditions. The conditions selected will 
depend on the particular circumstances (depending, for example, on the G+C content, type 
of target nucleic acid, source of nucleic acid, size of hybridization probe, etc.). Optimization 
5 of hybridization conditions for the particular application of interest is well known to those 
of skill in the art. After washing of the hybridized molecules to remove non-specifically 
bound probe molecules, hybridization is detected, and/or quantified, by determining the 
amount of bound label. Representative solid phase hybridization methods are disclosed in 
U.S. Patent Nos. 5,843,663, 5,900,481 and 5,919,626. Othermethods of hybridization that 
10 may be used in the practice of the present invention are disclosed in U.S. Patent Nos. 
5,849,481, 5,849,486 and 5,851,772. The relevant portions of these and other references 
identified in this section of the Specification are incorporated herein by reference. 

2. Amplification of Nucleic Acids 

1 5 Nucleic acids used as a template for amplification may be isolated from cells, tissues 

or other samples according to standard methodologies (Sambrook et aL, 1989). In certain 
embodiments, analysis is performed on whole cell or tissue homogenates or biological fluid 
samples without substantial purification of the template nucleic acid. The nucleic acid may 
be genomic DNA or fractionated or whole cell RNA. Where RNA is used, it may be desired 

20 to first convert the RNA to a complementary DNA. 

The term "primer, 11 as used herein, is meant to encompass any nucleic acid that is 
capable of priming the synthesis of a nascent nucleic acid in a template-dependent process. 
Typically, primers are oligonucleotides from ten to twenty and/or thirty base pairs in length, 
but longer sequences can be employed. Primers may be provided in double-stranded and/or 

25 single-stranded form, although the single-stranded form is preferred. 

Pairs of primers designed to selectively hybridize to a nucleic acid of interest are 
contacted with the template nucleic acid under conditions that permit selective hybridization. 
Depending upon the desired application, high stringency hybridization conditions may be 
selected that will only allow hybridization to sequences that are completely complementary 

30 to the primers. In other embodiments, hybridization may occur under reduced stringency 
to allow for amplification of nucleic acids contain one or more mismatches with the primer 
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sequences. Once hybridized, the template-primer complex is contacted with one or more 
enzymes that facilitate template-dependent nucleic acid synthesis. Multiple rounds of 
amplification, also referred to as "cycles," are conducted until a sufficient amount of 
amplification product is produced. 
5 The amplification product may be detected or quantified. In certain applications, the 

detection may be performed by visual means. Alternatively, the detection may involve 
indirect identification of the product via chemiluminescence, radioactive scintigraphy of 
incorporated radiolabel or fluorescent label or even via a system using electrical and/or 
thermal impulse signals (Affymax technology; Bellus, 1994). 

10 A number of template dependent processes are available to amplify the 

oligonucleotide sequences present in a given template sample. One of the best known 
amplification methods is the polymerase chain reaction (referred to as PCR™) which is 
described in detail in U.S. Patent Nos. 4,683,195, 4,683,202 and 4,800,159, and in Innis et 
al., 1990, each of which is incorporated herein by reference in their entirety. 

15 A reverse transcriptase PCR™ amplification procedure may be performed to quantify 

the amount ofmRNA amplified. Methods of reverse transcribing RNA into cDNA are well 
known and described in Sambrook et al., 1989. Alternative methods for reverse 
transcription utilize thermostable DNA polymerases. These methods are described in WO 
90/07641. Polymerase chain reaction methodologies are well known in the art. 

20 Representative methods of RT-PCR are described in U.S. Patent No. 5,882,864. 

Another method for amplification is ligase chain reaction ("LCR"), disclosed in 
European Application No. 320 308, incorporated herein by reference in its entirety. U.S. 
Patent 4,883,750 describes a method similar to LCR for binding probe pairs to a target 
sequence. A method based on PCR™ and oligonucleotide ligase assay (OLA), disclosed in 

25 U.S. Patent 5,912,148, may also be used. 

Alternative methods for amplification of target nucleic acid sequences that may be 
used in the practice of the present invention are disclosed in U.S. Patent Nos. 5,843,650, 
5,846,709, 5,846,783, 5,849,546, 5,849,497, 5,849,547, 5,858,652, 5,866,366, 5,916,776, 
5,922,574, 5,928,905, 5,928,906, 5,932,451, 5,935,825, 5,939,291 and 5,942,391, GB 

30 Application No. 2 202 328, and in PCT Application No. PCT/US89/01025, each of which 
is incorporated herein by reference in its entirety. 
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Qbeta Replicase, described in PCT Application No. PCT/US87/00880, may also be 
used as an amplification method in the present invention. In this method, a replicative 
sequence of RNA that has a region complementary to that of a target is added to a sample 
in the presence of an RNA polymerase. The polymerase will copy the replicative sequence 
5 which may then be detected. 

An isothermal amplification method, in which restriction endonucleases and ligases 
are used to achieve the amplification of target molecules that contain nucleotide 
5'-[alpha-thio]-triphosphates in one strand of a restriction site may also be useful in the 
amplification of nucleic acids in the present invention (Walker et aL, 1992). Strand 

10 Displacement Amplification (SDA), disclosed in U.S. Patent No. 5,916,779, is another 
method of carrying out isothermal amplification of nucleic acids which involves multiple 
rounds of strand displacement and synthesis, i.e., nick translation. 

Other nucleic acid amplification procedures include transcription-based amplification 
systems (TAS), including nucleic acid sequence based amplification (NASBA) and 3SR 

1 5 (Kwoh et aL, 1 989; Gingeras et aL, PCT Application WO 88/1 03 1 5, incorporated herein by 
reference in their entirety). Davey et aL, European Application No. 329 822 disclose a 
nucleic acid amplification process involving cyclically synthesizing single-stranded RNA 
("ssRNA"), ssDNA, and double-stranded DNA (dsDNA), which may be used in accordance 
with the present invention. 

20 Miller et aL, PCT Application WO 89/06700 (incorporated herein by reference in 

its entirety) disclose a nucleic acid sequence amplification scheme based on the 
hybridization of a promoter region/primer sequence to a target single-stranded DNA 
("ssDNA") followed by transcription of many RNA copies of the sequence. This scheme 
is not cyclic, i.e., new templates are not produced from the resultant RNA transcripts. Other 

25 amplification methods include "race" and "one-sided PCR" (Frohman, 1990; Ohara et aL, 
1989). 

3. Detection of Nucleic Acids 

Following any amplification, it may be desirable to separate the amplification 
30 product from the template and/or the excess primer. In one embodiment, amplification 
products are separated by agarose, agarose-acrylamide orpolyacrylamide gel electrophoresis 
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using standard methods (Sambrook et al. 9 1989). Separated amplification products may be 
cut out and eluted from the gel for further manipulation. Using low melting point agarose 
gels, the separated band may be removed by heating the gel, followed by extraction of the 
nucleic acid. 

5 Separation of nucleic acids may also be effected by chromatographic techniques 

known in art. There are many kinds of chromatography which may be used in the practice 
of the present invention, including adsorption, partition, ion-exchange, hydroxylapatite, 
molecular sieve, reverse-phase, column, paper, thin-layer, and gas chromatography as well 
as HPLC. 

10 In certain embodiments, the amplification products are visualized. A typical 

visualization method involves staining of a gel with ethidium bromide and visualization of 
bands under UV light. Alternatively, if the amplification products are integrally labeled 
with radio- or fluorometrically-labeled nucleotides, the separated amplification products can 
be exposed to x-ray film or visualized under the appropriate excitatory spectra. 

15 In one embodiment, following separation of amplification products, a labeled nucleic 

acid probe is brought into contact with the amplified marker sequence. The probe preferably 
is conjugated to a chromophore but may be radiolabeled. In another embodiment, the probe 
is conjugated to a binding partner, such as an antibody or biotin, or another binding partner 
carrying a detectable moiety. 

20 In particular embodiments, detection is by Southern blotting and hybridization with 

a labeled probe. The techniques involved in Southern blotting are well known to those of 
skill in the art. See Sambrook et aL, 1989. One example of the foregoing is described in 
U.S. Patent No. 5,279,721, incorporated by reference herein, which discloses an apparatus 
and method for the automated electrophoresis and transfer of nucleic .acids. The apparatus 

25 permits electrophoresis and blotting without external manipulation of the gel and is ideally 
suited to carrying out methods according to the present invention. 

Other methods of nucleic acid detection that may be used in the practice of the instant 
invention are disclosed in U.S. Patent Nos. 5,840,873, 5,843,640, 5,843,651, 5,846,708, 
5,846,717, 5,846,726, 5,846,729, 5,849,487, 5,853,990, 5,853,992, 5,853,993, 5,856,092, 
30 5,861,244, 5,863,732, 5,863,753, 5,866,331, 5,905,024, 5,910,407, 5,912,124, 5,912,145, 
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5,919,630, 5,925,517, 5,928,862, 5,928,869, 5,929,227, 5,932,413 and 5,935,791, each of 
which is incorporated herein by reference. 



4. Other Assays 

5 Other methods for genetic screening may be used within the scope of the present 

invention, for example, to detect mutations in genomic DNA, cDNA and/or RNA samples. 
Methods used to detect point mutations include denaturing gradient gel electrophoresis 
("DGGE"), restriction fragment length polymorphism analysis ("RFLP"), chemical or 
enzymatic cleavage methods, direct sequencing of target regions amplified by PCRTM (see 

10 above), single-strand conformation polymorphism analysis ("SSCP") and other methods 
well known in the art. 

One method of screening for point mutations is based on RNase cleavage of base 
pair mismatches in RNA/DNA or RNA/RNA heteroduplexes. As used herein, the term 
"mismatch" is defined as a region of one or more unpaired or mispaired nucleotides in a 

15 double-stranded RNA/RNA, RNA/DNA or DNA/DNA molecule. This definition thus 
includes mismatches due to insertion/deletion mutations, as well as single or multiple base 
point mutations. 

U.S. Patent No. 4,946,773 describes an RNase A mismatch cleavage assay that 
involves annealing single-stranded DNA or RNA test samples to an RNA probe, and 
20 subsequent treatment of the nucleic acid duplexes with RNase A. For the detection of 
mismatches, the single-stranded products of the RNase A treatment, electrophoretically 
separated according to size, are compared to similarly treated control duplexes. Samples 
containing smaller fragments (cleavage products) not seen in the control duplex are scored 
as positive. 

25 Other investigators have described the use of RNase I in mismatch assays. The use 

of RNase I for mismatch detection is described in literature from Promega Biotech. 
Promega markets a kit containing RNase I that is reported to cleave three out of four known 
mismatches. Others have described using the MutS protein or other DNA-repair enzymes 
for detection of single-base mismatches. 

30 Alternative methods for detection of deletion, insertion or substitution mutations 

that may be used in the practice of the present invention are disclosed in U.S. Patent Nos. 
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5,849,483, 5,851,770, 5,866,337, 5,925,525 and 5,928,870, each of which is incorporated 
herein by reference in its entirety. 

5. Kits 

5 All the essential materials and/or reagents required for detecting a nucleic acid 

sequence of interest in a sample may be assembled together in a kit. This generally will 
comprise a probe or primers designed to hybridize specifically to individual nucleic acids 
of interest or reporter sequence in the practice of the present invention. Also included may 
be enzymes suitable for amplifying nucleic acids, including various polymerases (reverse 
10 transcriptase, Taq, etc.), deoxynucleotides and buffers to provide the necessary reaction 
mixture for amplification. Such kits may also include enzymes and other reagents suitable 
for detection of specific nucleic acids or amplification products. Such kits generally will 
comprise, in suitable means, distinct containers for each individual reagent or enzyme as 
well as for each probe or primer pair. 

15 

Mutagenesis 

Where employed, mutagenesis will be accomplished by a variety of standard, 
mutagenic procedures. Mutation is the process whereby changes occur in the quantity or 
structure of an organism. Mutation can involve modification of the nucleotide sequence of 

20 a single gene, blocks of genes or whole chromosome. Changes in single genes may be the 
consequence of point mutations which involve the removal, addition or substitution of a 
single nucleotide base within a DNA sequence, or they may be the consequence of changes 
involving the insertion or deletion of large numbers of nucleotides. 

Mutations can arise spontaneously as a result of events such as errors in the fidelity 

25 of DNA replication or the movement of transposable genetic elements (transposons) within 
the genome. They also are induced following exposure to chemical or physical mutagens. 
Such mutation-inducing agents include ionizing radiations, ultraviolet light and a diverse 
array of chemical such as alkylating agents and polycyclic aromatic hydrocarbons, all of 
which are capable of interacting either directly or indirectly (generally following some 

30 metabolic biotransformations) with nucleic acids. The DNA lesions induced by such 
environmental agents may lead to modifications of base sequence when the affected DNA 
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is replicated or repaired and thus to a mutation. Mutation also can be site-directed through 
the use of particular targeting methods. 

A, Random Mutagenesis 
5 i) Insertional Mutagenesis 

Insertional mutagenesis is based on the inactivation of a gene via insertion of a 
known DNA fragment. Because it involves the insertion of some type of DNA fragment, 
the mutations generated are generally loss-of-function, rather than gain-of-function 
mutations. However, there are several examples of insertions generating gain-of-fiinction 

10 mutations (Oppenheimer et al. 1991). Insertion mutagenesis has been very successful in 
bacteria and Drosophila (Cooley et al. 1988) and recently has become a powerful tool in 
corn (Schmidt et al. 1987); Arabidopsis; (Marks et al., 1991; Koncz et al. 1990); and 
Antirrhinum (Sommer et al. 1990). 

Transposable genetic elements are DNA sequences that can move (transpose) from 

15 one place to another in the genome of a cell. The first transposable elements to be 
recognized were the Activator/Dissociation elements of Zea mays (McClintock, 1957). 
Since then, they have been identified in a wide range of organisms, both prokaryotic and 
eukaryotic. 

Transposable elements in the genome are characterized by being flanked by direct 
20 repeats of a short sequence of DNA that has been duplicated during transposition and is 
called a target site duplication. Virtually all transposable elements whatever their type, and 
mechanism of transposition, make such duplications at the site of their insertion. In some 
cases the number of bases duplicated is constant, in other cases it may vary with each 
transposition event. Most transposable elements have inverted repeat sequences at their 
25 termini, these terminal inverted repeats may be anything from a few bases to a few hundred 
bases long and in many cases they are known to be necessary for transposition. 

Prokaryotic transposable elements have been most studied in E. coli and Gram 
negative bacteria, but also are present in Gram positive bacteria. They are generally termed 
insertion sequences if they are less than about 2 kB long, or transposons if they are longer. 
30 Bacteriophages such as mu and D 1 08, which replicate by transposition, make up a third type 
of transposable element, elements of each type encode at least one polypeptide a 
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transposase, required for their own transposition. Transposons often further include genes 
coding for function unrelated to transposition, for example, antibiotic resistance genes. 

Transposons can be divided into two classes according to their structure. First, 
compound or composite transposons have copies of an insertion sequence element at each 
5 end, usually in an inverted orientation. These transposons require transposases encoded by 
one of their terminal IS elements. The second class of transposon have terminal repeats of 
about 30 base pairs and do not contain sequences from IS elements. 

Transposition usually is either conservative or replicative, although in some cases 
it can be both. In replicative transposition, one copy of the transposing element remains at 

10 the donor site, and another is inserted at the target site. In conservative transposition, the 
transposing element is excised from one site and inserted at another. Eukaryotic elements 
also can be classified according to their structure and mechanism of transportation. The 
primary distinction is between elements that transpose via an RNA intermediate, and 
elements that transpose directly from DNA to DNA. 

15 Elements that transpose via an RNA intermediate often are referred to as 

retrotransposons, and their most characteristic feature is that they encode polypeptides that 
are believed to have reverse transcriptionase activity. There are two types of retrotransposon. 
Some resemble the integrated proviral DNA of a retrovirus in that they have long direct 
repeat sequences, long terminal repeats (LTRs), at each end. The similarity between these 

20 retrotransposons and proviruses extends to their coding capacity. They contain sequences 
related to the gag and pol genes of a retrovirus, suggesting that they transpose by a 
mechanism related to a retroviral life cycle. Retrotransposons of the second type have no 
terminal repeats. They also code for gag- and pol-like polypeptides and transpose by reverse 
transcription of RNA intermediates, but do so by a mechanism that differs from that or 

25 retrovirus-like elements. Transposition by reverse transcription is a replicative process and 
does not require excision of an element from a donor site. 

Transposable elements are an important source of spontaneous mutations, and have 
influenced the ways in which genes and genomes have evolved. They can inactivate genes 
by inserting within them, and can cause gross chromosomal rearrangements either directly, 

30 through the activity of their transposases, or indirectly, as a result of recombination between 
copies of an element scattered around the genome. Transposable elements that excise often 



40 



WO 00/79264 PCT/US00/1 7493 

do so imprecisely and may produce alleles coding for altered gene products if the number 
of bases added or deleted is a multiple of three. 

Transposable elements themselves may evolve in unusual ways. If they were 
inherited like other DNA sequences, then copies of an element in one species would be more 
5 like copies in closely related species than copies in more distant species. This is not always 
the case, suggesting that transposable elements are occasionally transmitted horizontally 
from one species to another. 

ii) Chemical mutagenesis 

10 Chemical mutagenesis offers certain advantages, such as the ability to find a full 

range of mutant alleles with degrees of phenotypic severity, and is facile and inexpensive 
to perform. The majority of chemical carcinogens produce mutations in DNA. 
Benzo[a]pyrene, N-acetoxy-2-acetyl aminofluorene and aflotoxin Bl cause GC to TA 
transversions in bacteria and mammalian cells. Benzo[a]pyrene also can produce base 

15 substitutions such as AT to TA. N-nitroso compounds produce GC to AT transitions. 
Alkylation of the 04 position of thymine induced by exposure to n-nitrosoureas results in 
TA to CG transitions. 

A high correlation between mutagenicity and carcinogenity is the underlying 
assumption behind the Ames test (McCann et al., 1975) which speedily assays for mutants 

20 in a bacterial system, together with an added rat liver homogenate, which contains the 
microsomal cytochrome P450, to provide the metabolic activation of the mutagens where 
needed. 

In vertebrates, several carcinogens have been found to produce mutation in the ras 
proto-oncogene. N-nitroso-N-methyl urea induces mammary, prostate and other carcinomas 
25 in rats with the majority of the tumors showing a G to A transition at the second position in 
codon 12 of the Ha-ras oncogene. Benzo[a]pyrene-induced skin tumors contain A to T 
transformation in the second codon of the Ha-ras gene. 

iii) Radiation Mutagenesis 

30 The integrity of biological molecules is degraded by the ionizing radiation. 

Adsorption of the incident energy leads to the formation of ions and free radicals, and 
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breakage of some covalent bonds. Susceptibility to radiation damage appears quite variable 
between molecules, and between different crystalline forms of the same molecule. It 
depends on the total accumulated dose, and also on the dose rate (as once free radicals are 
present, the molecular damage they cause depends on their natural diffusion rate and thus 
5 upon real time). Damage is reduced and controlled by making the sample as cold as 
possible. 

Ionizing radiation causes DNA damage and cell killing, generally proportional to the 
dose rate. Ionizing radiation has been postulated to induce multiple biological effects by 
direct interaction with DNA, or through the formation of free radical species leading to DNA 

1 0 damage (Hall, 1988). These effects include gene mutations, malignant transformation, and 
cell killing. Although ionizing radiation has been demonstrated to induce expression of 
certain DNA repair genes in some prokaryotic and lower eukaryotic cells, little is known 
about the effects of ionizing radiation on the regulation of mammalian gene expression 
(Borek, 1985). Several studies have described changes in the pattern of protein synthesis 

15 observed after irradiation of mammalian cells. For example, ionizing radiation treatment 
of human malignant melanoma cells is associated with induction of several unidentified 
proteins (Boothman et al., 1989). Synthesis of cyclin and co-regulated polypeptides is 
suppressed by ionizing radiation in rat REF52 cells, but not in oncogene-transformed REF52 
cell lines (Lambert and Borek, 1988). Other studies have demonstrated that certain growth 

20 factors or cytokines may be involved in x-ray-induced DNA damage. In this regard, 
platelet-derived growth factor is released from endothelial cells after irradiation (Witte, et 
al., 1989). 

In the present invention, the term "ionizing radiation" means radiation comprising 
particles or photons that have sufficient energy or can produce sufficient energy via nuclear 

25 interactions to produce ionization (gain or loss of electrons). An exemplary and preferred 
ionizing radiation is an x-radiation. The amount of ionizing radiation needed in a given cell 
generally depends upon the nature of that cell. Typically, an effective expression-inducing 
dose is less than a dose of ionizing radiation that causes cell damage or death directly. 
Means for determining an effective amount of radiation are well known in the art. 

30 In a certain embodiments, an effective expression inducing amount is from about 2 

to about 30 Gray (Gy) administered at a rate of from about 0.5 to about 2 Gy/minute. Even 
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more preferably, an effective expression inducing amount of ionizing radiation is from about 
5 to about 15 Gy. In other embodiments, doses of 2-9 Gy are used in single doses. An 
effective dose of ionizing radiation may be from 10 to 100 Gy, with 15 to 75 Gy being 
preferred, and 20 to 50 Gy being more preferred. 
5 Any suitable means for delivering radiation to a tissue may be employed in the 

present invention in addition to external means. For example, radiation may be delivered 
by first providing a radiolabeled antibody that immunoreacts with an antigen of the tumor, 
followed by delivering an effective amount of the radiolabeled antibody to the tumor. In 
addition, radioisotopes may be used to deliver ionizing radiation to a tissue or cell. 

10 

iv) In Vitro Scanning Mutagenesis 

Random mutagenesis also may be introduced using error prone PCR (Cadwell and 
Joyce, 1992). The rate of mutagenesis may be increased by performing PCR in multiple 
tubes with dilutions of templates. 

15 One particularly useful mutagenesis technique is alanine scanning mutagenesis in 

which a number of residues are substituted individually with the amino acid alanine so that 
the effects of losing side-chain interactions can be determined, while minimizing the risk of 
large-scale perturbations in protein conformation (Cunningham et al., 1989). 

In recent years, techniques for estimating the equilibrium constant for ligand binding 

20 using minuscule amounts of protein have been developed (Blackburn et aL, 1991; U.S. 
Patents 5,221,605 and 5,238,808). The ability to perform functional assays with small 
amounts of material can be exploited to develop highly efficient, in vitro methodologies for 
the saturation mutagenesis of antibodies. The inventors bypassed cloning steps by 
combining PCR mutagenesis with coupled in vitro transcription/translation for the high 

25 throughput generation of protein mutants. Here, the PCR products are used directly as the 
template for the in vitro transcription/translation of the mutant single chain antibodies. 
Because of the high efficiency with which all 19 amino acid substitutions can be generated 
and analyzed in this way, it is now possible to perform saturation mutagenesis on numerous 
residues of interest, a process that can be described as in vitro scanning saturation 

30 mutagenesis (Burks et al., 1997). 
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In vitro scanning saturation mutagenesis provides a rapid method for obtaining a 
large amount of structure-function information including: (i) identification of residues that 
modulate ligand binding specificity, (ii) a better understanding of ligand binding based on 
the identification of those amino acids that retain activity and those that abolish activity at 
a given location, (iii) an evaluation of the overall plasticity of an active site or protein 
subdomain, (iv) identification of amino acid substitutions that result in increased binding. 

v) Random Mutagenesis by Fragmentation and Reassembly 

A method for generating libraries of displayed polypeptides is described in U.S. 
Patent 5,380,721. The method comprises obtaining polynucleotide library members, 
pooling and fragmenting the polynucleotides, and reforming fragments therefrom, 
performing PCR amplification, thereby homologously recombining the fragments to form 
a shuffled pool of recombined polynucleotides. 

B. Site-Directed Mutagenesis 

Structure-guided site-specific mutagenesis represents a powerful tool for the 
dissection and engineering of protein-ligand interactions (Wells, 1 996, Braisted et al., 1 996). 
The technique provides for the preparation and testing of sequence variants by introducing 
one or more nucleotide sequence changes into a selected DNA. 

Site-specific mutagenesis uses specific oligonucleotide sequences which encode the 
DNA sequence of the desired mutation, as well as a sufficient number of adjacent, 
unmodified nucleotides. In this way, a primer sequence is provided with sufficient size and 
complexity to form a stable duplex on both sides of the deletion junction being traversed. 
A primer of about 17 to 25 nucleotides in length is preferred, with about 5 to 10 residues on 
both sides of the junction of the sequence being altered. 

The technique typically employs a bacteriophage vector that exists in both a 
single-stranded and double-stranded form. Vectors useful in site-directed mutagenesis 
include vectors such as the Ml 3 phage. These phage vectors are commercially available and 
their use is generally well known to those skilled in the art. Double-stranded plasmids are 
also routinely employed in site-directed mutagenesis, which eliminates the step of 
transferring the gene of interest from a phage to a plasmid. 
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In general, one first obtains a single-stranded vector, or melts two strands of a 
double-stranded vector, which includes within its sequence a DNA sequence encoding the 
desired protein or genetic element. An oligonucleotide primer bearing the desired mutated 
sequence, synthetically prepared, is then annealed with the single-stranded DNA 
5 preparation, taking into account the degree of mismatch when selecting hybridization 
conditions. The hybridized product is subjected to DNA polymerizing enzymes such as E. 
coli polymerase I (Klenow fragment) in order to complete the synthesis of the 
mutation-bearing strand. Thus, a heteroduplex is formed, wherein one strand encodes the 
original non-mutated sequence, and the second strand bears the desired mutation. This 
1 0 heteroduplex vector is then used to transform appropriate host cells, such as E. coli cells, and 
clones are selected that include recombinant vectors bearing the mutated sequence 
arrangement. 

Comprehensive information on the functional significance and information content 
of a given residue of protein can best be obtained by saturation mutagenesis in which all 19 

15 amino acid substitutions are examined. The shortcoming of this approach is that the 
logistics of multiresidue saturation mutagenesis are daunting (Warren et al 9 1996, Brown 
et aU 1996; Zeng et aL, 1996; Burton and Barbas, 1994; Yelton et al 9 1995; Jackson et al 9 
1995; Short et aU 1995; Wong et al 9 1996; Hilton et al 9 1996). Hundreds, and possibly 
even thousands, of site specific mutants must be studied. However, improved techniques 

20 make production and rapid screening of mutants much more straightforward. See also, U.S. 
Patents 5,798,208 and 5,830,650, for a description of ' Valk-through" mutagenesis. 

Other methods of site-directed mutagenesis are disclosed in U.S. Patents 5,220,007; 
5,284,760; 5,354,670; 5,366,878; 5,389,514; 5,635,377; and 5,789,166. 

25 Methods of Making Transgenic Mice 

A particular embodiment of the present invention provides transgenic animals that 
contain the knockin/fusion constructs of interest. In a general aspect, a transgenic animal 
is produced by the integration of a given transgene into the genome in a manner that permits 
30 the expression of the transgene. Methods for producing transgenic animals are generally 
described by Wagner and Hoppe (U.S. Patent 4,873,191; which is incorporated herein by 



45 



WO 00/79264 



PCT/US00/17493 



reference), Brinster et al. 1985; which is incorporated herein by reference in its entirety) and 
in "Manipulating the Mouse Embryo; A Laboratory Manual" 2nd edition (eds., Hogan, 
Beddington, Costantimi and Long, Cold Spring Harbor Laboratory Press, 1994; which is 
incorporated herein by reference in its entirety). 
5 Typically, a gene flanked by genomic sequences is transferred by microinj ection into 

a fertilized egg. The microinj ected eggs are implanted into a host female, and the progeny 
are screened for the expression of the transgene. Transgenic animals may be produced from 
the fertilized eggs from a number of animals including, but not limited to reptiles, 
amphibians, birds, mammals, and fish. 

1 0 DNA clones for microinj ection can be prepared by any means known in the art. For 

example, DNA clones for microinjection can be cleaved with enzymes appropriate for 
removing the bacterial plasmid sequences, and the DNA fragments electrophoresed on 1% 
agarose gels in TBE buffer, using standard techniques. The DNA bands are visualized by 
staining with ethidium bromide, and the band containing the expression sequences is 

1 5 excised. The excised band is then placed in dialysis bags containing 0.3 M sodium acetate, 
pH 7.0. DNA is electroeluted into the dialysis bags, extracted with a 1 : 1 phenolrchloroform 
solution and precipitated by two volumes of ethanol. The DNA is redissolved in 1 ml of low 
salt buffer (0.2 M NaCl, 20 mM Tris,pH 7.4, and 1 mM EDTA) and purified on an 
Elutip-D™column. The column is first primed with 3 ml of high salt buffer (1 M NaCl, 20 

20 mM Tris, pH 7.4, and 1 mM EDTA) followed by washing with 5 ml of low salt buffer. The 
DNA solutions are passed through the column three times to bind DNA to the column 
matrix. After one wash with 3 ml of low salt buffer, the DNA is eluted with 0.4 ml high salt 
buffer and precipitated by two volumes of ethanol. DNA concentrations are measured by 
absorption at 260 nm in a UV spectrophotometer. For microinjection, DNA concentrations 

25 are adjusted to 3 mg/ml in 5 mM Tris, pH 7.4 and 0. 1 mM EDTA. 

Other methods for purification of DNA for microinjection are described in Hogan 
et al. Manipulating the Mouse Embryo (Cold Spring Harbor Laboratory, Cold Spring 
Harbor, NY, 1986), in Palmiter et al. Nature 300:611 (1982); in The Qiagenologist, 
Application Protocols, 3rd edition, published by Qiagen, Inc., Chatsworth, CA.; and in 

30 Sambrook et al. Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Laboratory, 
Cold Spring Harbor, NY, 1989). 
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In an exemplary microinjection procedure, female mice six weeks of age are induced 
to superovulate with a 5 IU injection (0.1 cc, ip) of pregnant mare serum gonadotropin 
(PMSG; Sigma) followed 48 hours later by a 5 IU injection (0.1 cc, ip) of human chorionic 
gonadotropin (hCG; Sigma). Females are placed with males immediately after hCG 
5 injection. Twenty-one hours after hCG injection, the mated females are sacrificed by C02 
asphyxiation or cervical dislocation and embryos are recovered from excised oviducts and 
placed in Dulbecco's phosphate buffered saline with 0.5% bovine serum albumin (BSA; 
Sigma). Surrounding cumulus cells are removed with hyaluronidase(l mg/ml). Pronuclear 
embryos are then washed and placed in Earle's balanced salt solution containing 0.5 % BSA 

10 (EBSS) in a 37.5°C incubator with a humidified atmosphere at 5% C02, 95% air until the 
time of injection. Embryos can be implanted at the two-cell stage. 

Randomly cycling adult female mice are paired with vasectomized males. C57BL/6 
or Swiss mice or other comparable strains can be used for this purpose. Recipient females 
are mated at the same time as donor females. At the time of embryo transfer, the recipient 

15 females are anesthetized with an intraperitoneal injection of 0.015 ml of 2.5 % avertin per 
gram of body weight. The oviducts are exposed by a single midline dorsal incision. An 
incision is then made through the body wall directly over the oviduct. The ovarian bursa is 
then torn with watchmakers forceps. Embryos to be transferred are placed in DPBS 
(Dulbecco's phosphate buffered saline) and in the tip of a transfer pipet (about 10 to 12 

20 embryos). The pipet tip is inserted into the infundibulum and the embryos transferred. 
After the transfer, the incision is closed by two sutures. 

Dosage and Formulation 

The compounds (active ingredients) of this invention can be formulated and 
25 administered to treat a variety of disease states by any means that produces contact of the 
active ingredient with the agent's site of action in the body of a mammal. They can be 
administered by any conventional means available for use in conjunction with 
pharmaceuticals, either as individual therapeutic active ingredients or in a combination of 
therapeutic active ingredients. They can be administered alone, but are generally 
30 administered with a pharmaceutical carrier selected on the basis of the chosen route of 
administration and standard pharmaceutical practice. 
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The dosage administered will be a therapeutically effective amount of active 
ingredient and will, of course, vary depending upon known factors such as the 
pharmacodynamic characteristics of the particular active ingredient and its mode and route 
of administration; age, sex, health and weight of the recipient; nature and extent of 
5 symptoms; kind of concurrent treatment, frequency of treatment and the effect desired. 

The active ingredient can be administered orally in solid dosage forms such as 
capsules, tablets and powders, or in liquid dosage forms such as elixirs, syrups, emulsions 
and suspensions. The active ingredient can also be formulated for administration 
parenterally by injection, rapid infusion, nasopharyngeal absorption or dermoabsorption. 

10 The agent may be administered intramuscularly, intravenously, or as a suppository. 

Gelatin capsules contain the active ingredient and powdered carriers such as lactose, 
sucrose, mannitol, starch, cellulose derivatives, magnesium stearate, stearic acid, and the 
like. Similar diluents can be used to make compressed tablets. Both tablets and capsules 
can be manufactured as sustained release products to provide for continuous release of 

1 5 medication over a period of hours. Compressed tablets can be sugar coated or film coated 
to mask any unpleasant taste and protect the tablet from the atmosphere, or enteric coated 
for selective disintegration in the gastrointestinal tract. 

Liquid dosage forms for oral administration can contain coloring and flavoring to 
increase patient acceptance. 

20 In general, water, a suitable oil, saline, aqueous dextrose (glucose), and related sugar 

solutions and glycols such as propylene glycol or polyethylene glycols are suitable carriers 
for parenteral solutions. Solutions for parenteral administration contain preferably a water 
soluble salt of the active ingredient, suitable stabilizing agents and, if necessary, buffer 
substances. Antioxidizing agents such as sodium bisulfate, sodium sulfite or ascorbic acid, 

25 either alone or combined, are suitable stabilizing agents. Also used are citric acid and its 
salts and sodium Ethylenediaminetetraacetic acid (EDTA). In addition, parenteral solutions 
can contain preservatives such as benzalkonium chloride, methyl- or propyl-paraben and 
chlorobutanol. Suitable pharmaceutical carriers are described in Remington's 
Pharmaceutical Sciences, a standard reference text in this field. 

30 Additionally, standard pharmaceutical methods can be employed to control the 

duration of action. These are well known in the art and include control release preparations 
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and can include appropriate macromolecules, for example polymers, polyesters, polyamino 
acids, polyvinyl, pyrolidone, ethylenevinylacetate, methyl cellulose, carboxymethyl 
cellulose or protamine sulfate. The concentration of macromolecules as well as the methods 
of incorporation can be adjusted in order to control release. Additionally, the agent can be 
5 incorporated into particles of polymeric materials such as polyesters, polyamino acids, 
hydrogels, poly (lactic acid) or ethylenevinylacetate copolymers. In addition to being 
incorporated, these agents can also be used to trap the compound in microcapsules. 

Useful pharmaceutical dosage forms for administration of the compounds of this 
invention can be illustrated as follows. 
10 Capsules: Capsules are prepared by filling standard two-piece hard gelatin 

capsulates each with 100 milligram of powdered active ingredient, 175 milligrams of 
lactose, 24 milligrams of talc and 6 milligrams magnesium stearate. 

Soft Gelatin Capsules: A mixture of active ingredient in soybean oil is prepared and 
injected by means of a positive displacement pump into gelatin to form soft gelatin capsules 
1 5 containing 1 00 milligrams of the active ingredient. The capsules are then washed and dried. 

Tablets: Tablets are prepared by conventional procedures so that the dosage unit is 
100 milligrams of active ingredient. 0.2 milligrams of colloidal silicon dioxide, 5 
milligrams of magnesium stearate, 275 milligrams of microcrystalline cellulose, 11 
milligrams of cornstarch and 98.8 milligrams of lactose. Appropriate coatings may be 
20 applied to increase palatability or to delay absorption. 

Injectable: A parenteral composition suitable for administration by injection is 
prepared by stirring 1.5% by weight of active ingredients in 10% by volume propylene 
glycol and water. The solution is made isotonic with sodium chloride and sterilized. 

Suspension: An aqueous suspension is prepared for oral administration so that each 
25 5 millimeters contain 100 milligrams of finely divided active ingredient, 200 milligrams of 
sodium carboxymethyl cellulose, 5 milligrams of sodium benzoate, 1.0 grams of sorbitol 
solution U.S.P. and 0.025 millimeters of vanillin. 

Accordingly, the pharmaceutical composition of the present invention may be 
delivered via various routes and to various sites in an animal body to achieve a particular 
30 effect (see, e.g., Rosenfeld et al. (1991), supra; Rosenfeld et al., Clin. Res., 39(2), 31 1A 
(1991a); Jaffe et al., supra; Berkner, supra). One skilled in the art will recognize that 
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although more than one route can be used for administration, a particular route can provide 
a more immediate and more effective reaction than another route. Local or systemic delivery 
can be accomplished by administration comprising application or instillation of the 
formulation into body cavities, inhalation or insufflation of an aerosol, or by parenteral 
5 introduction, comprising intramuscular, intravenous, peritoneal, subcutaneous, intradermal, 
as well as topical administration. 

The composition of the present invention can be provided in unit dosage form 
wherein each dosage unit, e.g., a teaspoonful, tablet, solution, or suppository, contains a 
predetermined amount of the composition, alone or in appropriate combination with other 

1 0 active agents. The term "unit dosage form" as used herein refers to physically discrete units 
suitable as unitary dosages for human and animal subjects, each unit containing a 
predetermined quantity of the compositions of the present invention, alone or in combination 
with other active agents, calculated in an amount sufficient to produce the desired effect, in 
association with a pharmaceutical^ acceptable diluent, carrier, or vehicle, where 

1 5 appropriate. The specifications for the unit dosage forms of the present invention depend on 
the particular effect to be achieved and the particular pharmacodynamics associated with the 
pharmaceutical composition in the particular host. 

These methods described herein are by no means all-inclusive, and further methods 
to suit the specific application will be apparent to the ordinary skilled artisan. Moreover, the 

20 effective amount of the compositions can be further approximated through analogy to 
compounds known to exert the desired effect. 

The following examples are offered by way of example, and are not intended to limit 
the scope of the invention in any manner. 

25 

EXAMPLE 1 

Placing the agouti cDNA Under the Control of an Imprinted Promoter 

30 The Snrpn gene (SEQ ID NO: 12) contains 1 0 exons within about 22 kb of genomic 

DNA (Shemer et al., 1997), and encodes two open reading frames, both expressed 
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exclusively from the paternal chromosome. A small open reading frame potentially encodes 
a 71 amino acid protein of unknown function and extends from exon 1 to within exon 3. A 
large open reading frame encodes the 240 amino acid SmN protein and is initiated in exon 
4. The agouti cDNA (SEQ ID NO: 13) was introduced into the Snrpn locus in mice by 
5 homologous recombination in embryonic stem (ES) cells. A replacement type vector with 
positive/negative selection markers was constructed to insert the agouti cDNA into the 
EcoRI site of exon 3 of the Snrpn gene leaving a 10-kb genomic region of potential 
regulatory significance flanking upstream of the knockin site (Figurela). 

Construction of the the targeting vector was as follows. Overlapping lambda phage 

10 clones containing the Snrpn gene were isolated from a mouse 129Sv/Ev genomic library. 
To construct the agouti knockin vector, a 0.58 kb Clal-BamHl fragment containing the 
agouti cDNA was eluted from plasmid p284. This fragment and a 0.46-kb Bgni-HindUl 
fragment containing the rabbit P-globin polyadenylation signal were cloned into the 
pBluescript vector (Stratagene, La Jolla, CA). A neomycin expression cassette with the RNA 

15 polymerase II promoter (Soriano et al., 1991) was then introduced downstream of the 
polyadenylation signal. The final targeting vector consisted of a 3 .4 kb EcoRl genomic DNA 
fragment as the 5' homology region and a 2.7 kb Sail (one end from polylinker of phage 
clone) genomic DNA fragment as the 3' homology region flanking the agouti and neomycin 
cassettes. The HSV-tk expression cassette with the rat p-actin promoter (McMahon and 

20 Bradley, 1990) was subsequently ligated downstream of the 3' homology region. The 
targeting vector was linearized at a unique Sail site outside of the 5' region of homology, 
leaving the plasmid backbone attached to the HSV-tk cassette. 

EXAMPLE 2 

25 

Generation of agouti knockin mice 

Introduction of the agouti cDNA and the rabbit p-globin polyadenylation signal into 
the exon 3 of the Snrpn gene terminates the Snrpn/agouti fusion RNA before the large open 
3 0 reading frame, although the large open reading frame is potentially intact. The Snrpn/ agouti 
fusion RNA contains the complete small open reading frame of the Snrpn gene followed by 
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the agouti open reading frame. The targeting vector containing a neomycin expression 
cassette for positive selection and the HS V-tk gene for negative selection was linearized, and 
15 |iig were electroporated into AB2.2 ES cells derived from the 129Sv/Ev mouse strain. 
Geneticin (G418 sulfate, 200 |ag/mL) and FIAU (0.2 jaM) selection was applied 24 hours 
5 after plating and maintained for 7-8 days. Colonies resistant to G418 and FIAU were 
picked, trypsinized, and seeded onto a feeder layer of mitotically inactivated STO cells in 
96-well plates. Correctly targeted G4 18 -resistant clones were identified by Southern blot 
analysis using 5' and 3' flanking probes shown in Figure la to assure that homologous 
recombination had occurred at both ends. The 5' flanking probe was a 0.35 kb Xbal-Sacl 

10 fragment from intron 1 of the Snrpn gene, and the 3' flanking probe was a 1.2-kb Hindlll 
fragment from intron 3 of the Snrpn gene. Sixteen targeted clones were identified after 
screening 96 G4 18 -resistant colonies. The targeted ES cells were injected into C57BL/6J 
blastocysts. Male chimeras were bred with C57BL/6J females, and germline transmission 
of the Swrprc/agouti fusion transgene was obtained (Figurelb). The heterozygous progeny 

15 with a 129Sv/Ev and C57BL/6J mixed background (Figure 2a, generation I) were further 
crossed with C57BL/6J mice to transfer the agouti transgene onto the homozygous 
nonagouti (a/a) background (Fig.2a, generation II) as determined by Southern blot analysis 
to distinguish the agouti A and nonagouti a alleles (Vrieling et al., 1 998). Only mice of a/a 
genotype were selected for further study. 

20 

EXAMPLE 3 

The agouti knockin/fusion sequence yields an imprinted coat color effect 

25 The effect of the agouti knockin/fusion sequence on coat color is demonstrated in 

Figure 3. When the agouti knockin/fusion sequence was inherited from a heterozygous 
female, the offspring carrying the knockin/fusion sequence were entirely black and were 
indistinguishable from nonagouti a/a littermates (Figure 3b). In contrast, when the agouti 
knockin/fusion sequence was inherited from a heterozygous male, the offspring carrying the 

3 0 knockin/fusion sequence demonstrated a distinctly lighter tannish abdominal coloration that 
was clearly different from the completely black, nonagouti a/a littermates that did not inherit 
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a knockin/fusion sequence (Figure 3 a). The ventral coat color of mice carrying the imprinted 
agouti knockin/fusion sequence switches back and forth in both directions through 
gametogenesis. Breeding of the male homozygous for the knockin/fusion sequence with 
C57BL/6J females (nonagouti a/a) gave 1 00% of the offspring with tannish ventrum. About 
5 half (48.1%) of the offspring have tannish ventrum from the breeding of a heterozygous 
male carrying the maternally inherited agouti knockin/fusion sequence with C57BL/6J 
(nonagouti a/a) females, indicating switching of the agouti knockin/fusion sequence from 
a maternal methylated to paternal unmethylated pattern during gametogenesis accompanied 
reexpression of the knockin/fusion sequence. Conversely, for the reciprocal cross using 

10 homozygous or heterozygous females carrying the paternally inherited knockin/fusion 
sequence bred with C57BL/6J (nonagouti a/a) males, 0% of the offspring showed tannish 
ventrum, indicating that the actively expressed paternal knockin/fusion sequence was 
switched to a maternal expression pattern and silenced during gametogenesis in the female. 
In summary, 68 mice inheriting the knockin/fusion sequence paternally had a tannish 

15 abdomen, while 69 mice inheriting the knockin/fusion sequence maternally had a black 
abdomen. 

EXAMPLE 4 

20 Switching of methylation pattern is preserved 

There is a differentially methylated region at the 5' end of the mouse Snrpn gene 
extending about 6 kb from promoter to intron 1, including the Snrpn CpG island, which 
correlates inversely with Snrpn expression (Shemer et aL, 1997). The maternal allele of the 

25 Snrpn gene is methylated at the CpG island and silenced, while the paternal allele is 
unmethylated and actively expressed. Figure 2 shows an analysis of the methylation status 
of the 5' differentially methylated sites (Sacll and BssHIl within intron 1 of the Snrpn gene; 
(Figure la) in three generations of mice carrying the agouti knockin/fiision sequence. 
Genomic DNA was isolated by the proteinase K/SDS digestion and phenol/chloroform 

30 extraction method (Sambrooket al., 1989). Genomic DNA(lO^g) from mice 1-1,11-2, and 
III-2 (Figure 2a) as well as the 129Sv/Ev control was digested overnight with Hindlll alone 
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or in combination with methylati on-sensitive enzymes, Sacll or BssHll. Digested DNA was 
transferred from 0.8% of agarose gels to Hybond N+ membrane (Amersham, Newark, NJ) 
as described (Sambrook et al., 1989) for Southern blot analysis. The probe was a 1.3-kb 
BssHll-EcoRl fragment from the Snrpn intron 1 region. Radiolabeled DNA probe was 
5 synthesized using the random hexamer method (Zoon, 1987), and hybridized in a buffer 
containing 0.25 M NaP0 4 pH 7, 0.25 M NaCl, 1 mM EDTA, 1 0% PEG-8000, 7%SDS, and 
1 % bovine serum albumin. Probes containing repetitive sequences were preassociated using 
an excess of mouse DNA to quench repeat hybridization (Sealey et al., 1985). Filters were 
washed to a final stringency of 0.5X SSC/0.1%SDS at 65 C. Autoradiography was 

10 performed at -80 C with intensifying screen and Kodak XA-R film. 

As shown in Figure 2b, the 12-kb Hindlll genomic fragment was completely 
digested by Sacll and i?ssHII on the paternally inherited agouti knockin/fusion sequence in 
mouse 1-1, while the maternally inherited wild type Snrpn allele was methylated and 
resistant to digestion with the methylation-sensitive enzymes (Figure 2b). In contrast, when 

1 5 the paternally inherited agouti knockin/fusion sequence was transmitted through a female, 
the agouti knockin/fusion sequence became fully methylated (Figure 2b, mouse II-2), 
indicating that there was a switch of the methylation pattern of Snrpn CpG island from a 
paternal to maternal state. When the maternally inherited agouti knockin/fusion sequence 
was further transmitted through a male, the methylation of the Snrpn CpG island switched 

20 back to the paternal unmethylated status (Figure 2b,mouse III-2). Thus, the differential 
methylation of the Snrpn CpG island switches from a paternal to the maternal methylated 
states or maternal to the paternal unmethylated state as would be expected in normal mice. 

EXAMPLE 5 

25 

The imprinting of expression is retained for the mutant allele 

RT-PCR analysis was used to examine the RNA expression of the wild type Snrpn 
allele and the Snrpn/ agouti knockin/fusion sequence. Total RNA was isolated using the 
30 guanidiniumthiocyanate/CsCl gradient method (Sambrook etal, 1989). For RT-PCR analysis, 
DNasel-treated total RNA was reverse transcribed using primer p3 (5'- 
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CTGTTCCACAATAGCCGTTGTC-3'; SEQIDNO;l) for the Snrpn wild type transcripts, 
p5 (5'-GCGAGTTCATGGAGGAGTTACTCC-3'; SEQ ID NO:2) for the Snrpn/ 'agouti 
knockin/fusion sequence, primer Neo-3 (S'-TGATGCTCTTCGTCCAGATCATCC^'; SEQ 
ID NO:3) for neomycin transgene, and primer HPRT-3 
5 (5'-CTTTCCAGTTAAAGTTGAGAGATC-3'; SEQ ID NO:4) for the HPRT gene. All PCR 
was carried out in 50 jliI reactions containing 10 mM Tris pH8.3, 1.5 mM MgC12, 50 mM 
KC1 5 200 |LiM dNTPs, 1 |xM primers, and 2.5 jil of RT reaction with cycling conditions of 94° 
C for 1 min, 59° C for 1 min, and 72° C for 1 min for 30 cycles. PCR products were analyzed 
on a 1 .2% agarose gel. The PCR primer sequences are as follows: for Snrpn wild type allele, 

10 pi (forward) 5'-TTGGTTCTGAGGAGTGATTTGC-3' (SEQ ID NO:5), and p2 (reverse) 
S'-CCTTGAATTCCACCACCTTG-S' (SEQ ID NO:6); for Snrpn/ agouti knockin/fiision 
sequence, pi (forward, listed above), and p4 (reverse) 5'- 
GTGGACGGTGAAGAAGCACAGGAAG-3' (SEQ ID NO:7); for neomycin transgene, Neo-1 
(forward) S'-CTTTTTGTCAAGACCGACCTGTCCG-S' (SEQIDNO:8), andNeo-2 (reverse) 

15 5 / -CTCGATGCGATGTTTCGCTTGGTG-3 / (SEQ ID NO:9); for HPRT internal control, 
HPRT-1 (forward) 5 , -ATGACCTAGATTTGTTTTGTATACC-3 / (SEQ ID NO: 10), andHPRT-2 
(reverse) 5'-GTAGCTCTTCAGTCTGATAAAATCTAC-3'(SEQ ID NO: 11). 

Figure 4 shows that only the paternally inherited transcript (either the Snrpn wild type 
or S>zrp/2/agouti knockin/fiision sequence) is present in mice heterozygous for the knockin/fiision 

20 sequence. The Snrpn/ agouti knockin/fusion sequence RNA was expressed in the skin of mice 
I- 1 and III- 1 (see pedigree in Figure 2a) carrying the paternally inherited agouti knockin/fusion 
sequence. When the knockin/fusion sequence was maternally inherited, the Snrpn/agouti 
knockin/fusion sequence was silenced, and the Snrpn RNA was expressed from the paternal 
wild type allele (Figure 4, mouse II-2). Imprinted expression of the Snrpn/agouti knockin/fusion 

25 sequence was also detected using RNA from the brain of mouse III- 1 . RNA containing the 
large open reading frame of Snrpn from the agouti knockin/fusion sequence was not detected 
by RT-PCR analysis of Snrpn exon 7. Therefore, the RT-PCR analysis shows that RNA is 
expressed from the paternal unmethylated chromosome but not from the maternal methylated 
chromosome. Taken together, the results demonstrate coat color variation based on imprinted 

30 regulation of the agouti cDNA under the control of the Snrpn promoter has been achieved. 
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EXAMPLE 6 

Demonstration of the Feasability of a Screen for Identification of Genetic Alterations 
That Affect Genomic Imprinting of the Snrpn Promoter 

5 

To further dissect the regulatory mechanisms controlling genomic imprinting, a novel 
strategy has been developed to use mutagenesis in the mouse to identify mutations affecting 
genomic imprinting. Towards this goal, a coat color marker has been placed under the control 
of an imprinted promoter so that mutations affecting imprinting can be detected through coat 
1 0 color variation. 

Genomic imprinting on human chromosome 1 5q 1 1 -ql 3 and the homologous region 
of mouse chromosome 7c as related to Prader Willi and Angelman syndrome, two mental 
retardation conditions affecting humans, was studied. The mouse and human genomes for 
this region show considerable similarity. Mutations affecting genomic imprinting are isolated. 

1 5 The Snrpn promoter in the mouse is used in the isolation procedure. This promoter lies within 
a CpG island with methylation and silencing of the maternal chromosome as opposed to absence 
of methylation and expression on the paternal chromosome. Agouti cDNA was introduced 
using homologous recombination in embryonic stem (ES) cells, and placed under the control 
of the Snrpn promoter. This knockin/fusion sequence was transmitted to the mouse germline. 

20 Coat-color variation was used as an easily observable marker to identify genetic alterations 
that affect imprinting of the Snrpn promoter. In non-agouti (a/a) mice, paternal inheritance 
of the knockin/fiision sequence conferred a tannish abdominal color; the abdomen was black 
if the knockin/fiision sequence was absent or was inherited maternally when using appropriate 
genotypes. The differential methylation and imprinted expression of the Snrpn promoter was 

25 preserved, and imprinting was also preserved when tested at an RNA level. The feasibility 
of using a knockin/fiision sequence approach for screening in the mouse was demonstrated 
using two breeding schemes. The two breeding schemes are depicted in Figures 5a and 5b. 
In breeding scheme 1 , a male mouse homozygous for the knockin/fusion sequence and a/a 
were mutagenized (treated with ethylnitrosourea (ENU)) and bred to a/a wild type female 

30 mouse. All offspring received the knockin/fusion sequence from the mutagenized male parent 
and had a tannish abdomen. Mutagenesis of mice was performed by methods well known 
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in the art, such as described in Justice et al. , 1 999 and Weber et al. , 2000 . The types of mutations 
that might be found are discussed herein. 

Results of mutagenesis through May 3 0, 2000 are indicated in Figure 5 , with recovery 
of one mutant in 1 832 mice screened with ENU mutagenesis of the wild type male and 0 out 
5 of 3 77 mice screened with mutagenesis of the homozygous transgenic male. The mutagenized 
male had been mated with a female homozygous for the fusion transgene, and all offspring 
have a black abdomen unless a mutation occurs. The recovered mutant with a tan abdomen 
is depicted in Figure 6. 

This mutant mouse lacks methylation of the maternally inherited knockin/fusion gene 

10 in the mutant mouse, as shown in Figure 7 . Southern blotting was performed using the same 
probe and restriction enzyme combination as for Figure 2 by standard methods in the art. 
Tail DNA was isolated and digested with#mdIII (H) or J7z«dIII+ Sacll (H/S) as indicated. 
Digestion with Hin&lll gives a larger fragment for the wild type chromosome and a smaller 
fragment for the knockin/fusion allele, and combined digestion with Hindlll and methylation- 

15 sensitive Sacll gives clear indication as to whether a particular allele is methylated or 
unmethylated. Six mice were analyzed including the parental mice (ENU mutagenized wild 
type male and homozygous transgenic female) and four offspring (three nonmutant littermates 
and the mutant animal shown as a solid symbol in that the far right). The data for Snrpn 
demonstrate that the male parent is homozygous wild type and has one methylated and one 

20 unmethylated chromosome. The female parent is homozygous mutant with again one methylated 
and one unmethylated chromosome. The three nonmutant littermates have the genotype as 
expected, all being heterozygous for the knockin mutation and having the wild type allele 
unmethylated and the mutant allele methylated. The mutant animal is again heterozygous 
for the knockin mutation and a normal allele, but in this case both alleles are unmethylated 

25 and digested by Sacll. This confirms that the mutant animal definitively has a molecular 
abnormality involving imprinting of the Snrpn locus. Studies at additional loci indicate that 
there was also a partial alteration in the methylation at the necdin (Ndn) locus. For all of the 
animals except the mutant, the H/S digestion gives two bands of equal intensity. In the mutant 
animal the lower band is much more intense and the upper band less intense, indicating that 

30 there is partial demethylation of the necdin locus, which is approximately 1 megabase away 
on the same chromosome. The H 1 9 gene is located at a great distance on the same chromosome 
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and is part of a different imprinted gene cluster and shows no abnormality of methylation. 
Similarly the IgF2R locus shows normal imprinting and methylation. 

Figure 8 shows preparation of bicistronic agouti and GFP reporter sequences under 
control of the Snrpn promoter. In this instance, the plasmid pIRES-EGFP (Clontech; Palo 
5 Alto, CA) was utilized. The agouti (Ag) cDNA was placed under the control of the Snrpn 
promoter, as for the construct shown in Figure 1 . This was followed by an internal ribosome 
entry site (IRES) and a cassette encoding enhanced green fluorescent protein (EGFP). A neo 
resistance cassette was used to introduce the construct into the germline, and the Cre recombinase 
was then used to delete the neo cassette, by standard methods in the art. This yields a bicistronic 

1 0 reporter cassette expressing agouti and EGFP. This construct is then introduced into mouse 
ES cells and is subsequently injected for germline transmission. 

Figure 9 illustrates the generation of two fusion gene constructs expressing human 
a- fetoprotein under control of the promoter for the LDL receptor (abbreviations are as follows : 
S A, splicing acceptor; pA, S V40 polyA signal). In the upper construct depicted in panel A, 

15 also referred to as the 5' targeting construct, the human a-fetoprotein (hAFP) is expressed 
using an IRES with the construct introduced in the first intron of the LDL receptor locus. 
In the lower construct in panel A, also referred to as the 3' targeting construct, the expression 
cassette is introduced in the 3' untranslated region of the LDL receptor locus. In the case of 
the 5' -targeting construct, exon 1 is spliced to the IRES, and hAFP is then expressed under 

20 the control of the promoter for the LDL receptor. In the case of the 3' -targeting construct, 
the gene is spliced in the usual mode and is expressed with production of protein from the 
LDL receptor. The last exon is modified to include the IRES and hAFP and gives rise to 
expression from the placement in the 3' -UTR. With the 5'-targeting construct the gene for 
the LDL receptor is disrupted, while it retains a functional manner with the 3 '-targeting construct. 

25 Panel B of Figure 9 demonstrates detectable hAFP in the tissue culture medium for 

each of the two constructs. The 5'- and 3'-targeting constructs were electroporated into mouse 
ES cells, and clones were screened for homologous recombination events by standard methods 
in the art. Appropriate clones were identified for both the 5'- and 3'-targeting constructs. 
These clones were then grown in tissue culture medium, and the medium was harvested for 

30 quantitation of hAFP using a commercial ELISA. The wild type ES cells (WT) do not express 
any detectable hAFP as expected. Easily detectable levels of hAFP are found with both the 
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5'-targeting and 3'-targeting constructs at 27 and 50 hours after subculture. The expression 
level is higher with the 5'-targeting construct. The ES cells are injected for transmission to 
the mouse germline, 

5 

EXAMPLE 7 
Bicistronic and more complex reporter sequences 

1 0 Bicistronic (two reporter sequences) reporter sequences containing combinations such 

as agouti coat-color marker and human growth hormone or agouti coat-color marker and green 
fluorescent protein are prepared. A major advantage of a bicistronic reporter sequence is that 
events irrelevant to the expression of the knockin/fusion sequence are easily distinguished 
in that they would ordinarily affect one reporter sequence, but not the other. The simultaneous 

1 5 increase or decrease of expression of both reporter sequences in a bicistronic construct greatly 
facilitates the elimination of uninteresting events from consideration. These bicistronic constructs 
are prepared by homologous recombination in ES cells including putting the agouti coat-color 
marker and GFP, and another construct with the agouti coat-color marker and HGH, under 
the control of the mouse specific nucleotide sequence promoter, for example the Snrpn promoter. 

20 There are other circumstances where it might be advantageous to have multiple secreted proteins 
in a bicistronic marker. In cases such as this, combinations of HGH (SEQ ID NO : 1 4), human- 
AFP (SEQ ID NO: 1 5), rat- AFP (SEQ ID NO: 16) or other secreted proteins are utilized. This 
is particularly relevant to circumstances where a small number of cells in a remote tissue such 
as brain is secreting the marker protein into the bloodstream. It is important that bicistronic 

25 constructs use a separate initiation for protein translation. This is achieved using an internal 
ribosome entry site (IRES) so that the two reporter proteins are not covalently attached and 
one reporter sequence is not secondarily affected posttranslationally by effects on the other 
reporter sequence. In the bicistronic constructs for analysis of transcriptional regulation, the 
promoter of a specific nucleotide sequence of interest is linked to the bicistronic reporter construct 

30 using homologous recombination in ES cells or in any other cultured mammalian or other 
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cell system or cell extract. Further redundancy is introduced into the system by preparation 
of more complex tricistronic reporter sequences. 

EXAMPLE 8 

5 

Transcriptional regulation. 

For examination of transcriptional regulation a promoter of interest is used and the 
reporter sequence is placed under the control of the promoter of interest. Reporter sequences 

10 of various types are placed under the control of promoters other than Snrpn to demonstrate 
the generality of this approach. It is well known that regulatory elements can be located near 
the promoter or at great distances upstream or downstream, including within large introns. 
In such instances, constructs are designed that mutate the initiation ATG codon and place 
the reporter sequence further downstream in the gene so that regulated expression of the reporter 

15 sequence is achieved. This is tested with the agouti reporter sequence similar to that seen 
in the Examples above. One skilled in the art recognizes that a variety of strategies can be 
used to preserve normal transcriptional control while placing the reporter sequence under 
the control of the promoter of interest. 

20 EXAMPLE 9 

Posttranscriptional regulation 

The knockin/fusion sequence strategy is also used to study certain types of 
25 posttranscriptional regulation. The reporter sequence is fused to the N-terminal or C-terminal 
portion of a protein to give a hybrid protein. Under these circumstances, a protein that is subject 
to degradation by a ubiquitin pathway, for example, can have its half-life determined by that 
mechanism, and the half-life of the fused reporter protein is similarly affected. The 
knockin/fusion sequences of this type are used to search for posttranscriptional effects. The 
30 constructs are introduced by homologous recombination into ES cells and introduced into 
the germline. Constructs are also introduced into other cell types by homologous recombination 
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or introduced as extra chromosomal molecules. In the case of posttranscriptional regulatory 
effects, other antigenic tags can be fused to proteins to facilitate quantitation. This invention 
combines the use of these protein fusions to screen for genetic effects on posttranscriptional 
control or biologic affects of compounds on posttranscriptional control. 

5 

EXAMPLE 10 

Potential deleterious effects of knockin/fusion sequences on phenotype 

1 0 The introduction of the agouti cDNA under the control of the Snrpn promoter potentially 

inactivates the Snrpn gene. Since it was already known that homozygous knockout of the 
Snrpn gene did not give an obvious mouse phenotype, it was possible to breed animals either 
heterozygous or homozygous for the knockin/fusion sequence. One skilled in the art recognizes 
that in many instances, the knockin/fusion sequence is only useful in the heterozygous state 

1 5 because the homozygous inactivation of the specific nucleotide sequence of interest is lethal 
or causes other phenotypic abnormalities. Further, the skilled artisan recognizes that even 
a heterozygous knockin/fusion sequence introduced by homologous recombination can have 
a deleterious or even lethal effect. In these instances, constructs are made in which the reporter 
sequence is introduced with an IRES restart in the 3' untranslated portion of the specific 

20 nucleotide sequence of interest. Using this strategy, any nucleotide sequence can be analyzed 
using the proposed methodology. Introduction of the reporter sequence into the 3' untranslated 
portion of the specific nucleotide sequence of interest using an IRES restart can be the preferred 
strategy. 

25 EXAMPLE 11 

Biological effects of compounds 

As has already been demonstrated using 5-azacytidine, it is possible to screen for effects 
30 of compounds on sequence expression in this system. Various compounds with the potential 
to regulate transcription can be tested with the methods of the present invention. These include, 
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but are not limited to, antisense nucleic acids, small xenotrophic molecules, and known biological 
agents occurring naturally in cells. Screening of compounds is carried out in vivo with intact 
animals such as mice or in vitro in cells, tissues, organs or tissue slices. This approach was 
used in the administration of azacytidine to intact animals. Tissue culture cells carrying the 
5 knockin/fusion sequence on the maternal chromosome with it methylated and silenced are 
available. With a bicistronic reporter sequence including agouti cDNA and human growth 
hormone, the secretion of human growth hormone is induced by treatment of the fibroblast 
cultures with 5 -azacytidine. This demonstrates the feasibility of screening compounds for 
biological effects either in vivo or in vitro in tissue culture. Although large numbers of 

1 0 compounds can be administered to animals such as mice to examine for obvious effects on 
coat color or increases or decreases in the blood level of HGH, tissue culture allows high- 
throughput screening. For example, 96 well tissue culture dishes can be used to screen multiple 
compounds easily and quickly with the methods described herein. 

In the instance of drugs, bicistronic tests are available. For example, a bicistronic 

1 5 message expressing HGH and GFP is placed under the control of the utrophin promoter (SEQ 
ID NO: 17). There is ample evidence that increased expression of utrophin in muscle can 
provide a cure for Duchenne muscular dystrophy, where dystrophin is genetically defective 
or absent. A large number of compounds are screened in intact mice and/or in cultured cells 
derived from such mice. The compounds that induce increased expression of utrophin can 

20 cause increased levels of growth hormone in the blood or tissue culture medium and thus 
induce the histological detectability of GFP. This is an example representing a search for 
upregulators of gene expression. 

A bicistronic reporter expressing HGH and hAFP under the control of the amyloid 
A beta precursor protein (APP) (SEQ ID NO: 1 8) promoter is tested. An alternative embodiment 

25 is placement of the bicistronic reporter sequence under the control of the promoter for the 
presenilin-1 sequence (SEQ ID NO:19) or presenilin-2 sequence (SEQ ID NO:20), which 
likely interfere with processing of APP, or the a-secretase (SEQ ID NO :2 1 ) that cleaves amyloid 
A in a manner that promotes Alzheimer disease. These genes are expressed at detectable levels 
in tissue culture cells. Again, intact animals such as mice or cell cultures derived from humans 

30 are tested. With the bicistronic HGH and hAFP reporter sequences, compounds that increase 
the expression of either of these genes show increased levels of HGH and hAFP in blood or 
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tissue culture medium and drugs that decrease expression show decreased levels of HGH and 
hAFP in blood or tissue culture medium. These two genes are tested separately seeking decreased 
expression as a preventive approach in Alzheimer disease. 



EXAMPLE 12 



Additional Specific Embodiments 



The methods described herein may be adapted to other sequences, promoters, cell 
10 lines, systems, etc. in accordance with the teachings provided herein. Additional specific 
embodiments include a mouse cell line for screening drugs which affect the LDL receptor 
(SEQ ID NO:22). Alternatively, apolipoprotein A-I (SEQ ID NO:23) is utilized as anucleic 
acid sequence of interest, wherein a system is created to screen for drugs that upregulate 
expression of this nucleic acid, which increases the level of HDL, or beneficial cholesterol. 
15 The current lack in the art of effective drugs to raise HDL cholesterol makes this system a 
valuable addition to the field. 

In another embodiment a system is created in accordance with the methods described 
herein to assay for drugs which regulate expression of genes implicated in pathogenesis of 
Alzheimers or other neurodegenerative diseases. For example, it is desirable to identify drugs 
20 which downregulate the expression of the amyloid beta A4 precursor protein (SEQ ID NO: 1 8) 
on human chromosome 2 1 . This gene is strongly implicated in the pathogenesis of Alzheimer's 
and it is thought that three copies of the gene in trisomy 21 contribute to early onset of 
Alzheimer' s disease in that particular medical condition. Other genes of interest would include 
presenilin-1 (SEQ ID NO:19) and presenilin-2 (SEQ ID NO:20). 

25 
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One skilled in the art readily appreciates that the present invention is well adapted 
to carry out the objectives and obtain the ends and advantages mentioned as well as those 
inherent therein. Systems, pharmaceutical compositions, treatments, methods, procedures 
and techniques described herein are presently representative of the preferred embodiments 
5 and are intended to be exemplary and are not intended as limitations of the scope. Changes 
therein and other uses will occur to those skilled in the art which are encompassed within 
the spirit of the invention or defined by the scope of the pending claims. 
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What we claim: 

1 . A system to determine elements that regulate expression of anucleotide sequence 
5 comprising a reporter sequence inserted within said nucleotide sequence under conditions 

wherein the expression of the reporter sequence is controlled by the promoter for said nucleotide 
sequence. 

2. The system of Claim 1 wherein the nucleotide sequence is selected from the 
1 0 group of organisms consisting of mammals, non-mammalian animals, plants, insects, aquatic 

organisms and avian species. 

3 . The system of Claim 1 wherein two different reporter sequences are inserted 
within the nucleotide sequence and both controlled by said promoter. 

15 

4. The system of Claim 1 wherein the reporter sequence is selected from the group 
consisting of the agouti gene, ocMSH, a-fetoprotein, p-galactosidase, p-glucuronidase, 
chloramphenicol acetyltransferase (CAT), green fluorescent protein (GFP), human growth 
hormone, luciferase, and epitope tag. 

20 

5 . As a composition of matter a mammal having a functional system of Claim 
1 inserted into its genome. 
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6 . As a composition of matter a non-mammalian animal having a functional system 
of Claim 1 inserted into its genome. 



7 . As a composition of matter a plant having a functional system of Claim 1 inserted 
5 into its genome. 

8. As a composition of matter an insect having a functional system of Claim 1 
inserted into its genome. 

10 9. As a composition of matter an aquatic organism having a functional system 

of Claim 1 inserted into its genome. 

10. As a composition of matter an avian species having a functional system of 
Claim 1 inserted into its genome. 

15 

11. As a composition of matter a cell line or cell free extract having a functional 
system of Claim 1 inserted into the cell. 

12. A method for identifying an upregulator or downregulator of expression for 
20 a specific nucleotide sequence, comprising the steps of: 

inserting a test system into a test organism, wherein the test system 
comprises a reporter sequence inserted into the specific nucleotide sequence under conditions 
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wherein the expression of the reporter sequence is controlled by the promoter for the specific 
nucleotide sequence; 

delivering a compound to the test organism containing the test system; 

and 

5 measuring the effect of said compound on said reporter sequence, wherein 

when said compound increases the expression of said reporter sequence, the compound is 
an upregulator and when said compound decreases the expression of said reporter sequence, 
the compound is a downregulator. 

10 13. The method of Claim 1 2 wherein the reporter sequence is selected from the 

group consisting of the agouti gene, ocMSH, a-fetoprotein, p-galactosidase, p-glucuronidase, 
chloramphenicol acetyltransferase (CAT), green fluorescent protein (GFP), human growth 
hormone, luciferase, and epitope tag. 

15 14. The method of Claim 1 2 wherein the test organism is selected from the group 

consisting of a mammal, a non-mammalian animal, a plant, an insect, an aquatic organism 
and an avian species. 

1 5 . The method of Claim 12, wherein the test organism is a cell line, cell free extract, 
20 tissue, tissue slice or organ. 

16. A method for identifying an upregulator or downregulator of expression of 
a specific nucleotide sequence comprising the steps of: 



74 



WO 00/79264 PCT/US00/17493 

inserting a test system into a test organism, wherein the test system 
comprises a reporter sequence inserted within the specific nucleotide sequence under conditions 
wherein the expression of the reporter sequence is controlled by the promoter for the specific 
nucleotide sequence; 

5 mutagenizing the test organism containing the test system; and 

measuring the effect of said mutagenesis on expression of said reporter 
sequence, wherein when the mutant resulting from said mutagenizing increases the expression 
of said reporter sequence, the mutant is an upregulator and when the mutant resulting from 
said mutagenizing decreases the expression of said reporter sequence it is a downregulator. 

10 

17. The method of Claim 16 further comprising the steps of: 

delivering a compound to the mutagenized test organism; and 
measuring the effect of said compound on said reporter sequence, wherein 
when said compound increases the expression of said reporter sequence it is an upregulator 
1 5 and when said compound decreases the expression of said reporter sequence it is a downregulator. 

18. The method of Claim 12 or 16 wherein two different reporter sequences are 
inserted within the nucleotide sequence and both are under the control of said promoter. 

20 19. The method of Claim 1 6 wherein the test organism is selected from the group 

consisting of a non-human mammal, a non-mammalian animal, a plant, an insect, an aquatic 
organism and an avian species. 
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20. The method of Claim 1 6, wherein the test organism is a cell line, cell free extract, 
tissue, tissue slice or organ. 



21. A method of identifying an upregulator to treat a disease in which there is a 
5 deficiency in expression of a nucleotide sequence comprising the method of Claim 12 or 1 6. 

22 . The method of Claim 2 1 wherein the disease in Duchenne Muscular Dystrophy 
and the nucleotide sequence is the sequence coding for utrophin. 

10 23 . A method of identifying a downregulator to treat a disease in which there is 

an excess in expression of a nucleotide sequence comprising the method of Claim 12 or 16. 

24. The method of claim of 23 where disease is Alzheimer's and the nucleotide 
sequence is the sequence coding for amyloid A beta precursor protein or presinilin-1 . 

15 

25 . A method of treating a disease in an organism in which the disease is the result 
of deficiency in expression of a nucleotide sequence comprising the step of introducing a 
pharmacologically effective dose of an upregulator identified in the method of Claim 12 or 
16. 

20 

26 . A method of treating a disease in an organism in which the disease is the result 
of an excess in expression of a nucleotide sequence comprising the step of introducing a 
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pharmacologically effective dose of a downregulator identified in the method of Claim 12 
or 16. 



27. A method to identify an upregulator or downregulator to enhance the food 
5 content of an animal, aquatic organism, avian species or plant comprising the method of Claim 

12 or 16. 

28. The system of Claim 1, wherein the reporter sequence is introduced into the 
3' untranslated portion of the nucleotide sequence. 

10 

29. The method of Claim 12 for identifying compounds that effect posttranscriptional 
control at the RNA level, wherein the reporter is inserted by homologous recombination or 
extrachromosomal nucleic acid, and the resulting construct expresses a fusion transcript. 

15 30. The method of Claim 1 6 for identifying mutant that effects posttranscriptional 

control at the RNA level wherein the reporter is inserted by homologous recombination or 
extrachromosomal nucleic acid, and the resulting construct expresses a fusion transcript. 

3 1 . The method of Claim 1 2 for identifying compounds that effect posttranscriptional 
20 control at the protein level by inserting the reporter sequence into the nucleotide sequence 
by homologous recombination or extrachromosomal nucleic acid, wherein such construct 
expresses a fusion protein. 
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32 . The method of Claim 1 6 for identifying mutant that effect posttranscriptional 
control at the protein level by inserting the reporter sequence into the nucleotide sequence 
by homologous recombination or extrachromosomal nucleic acid, wherein such construct 
expresses a fusion protein. 

5 

3 3 . The system of Claim 1 , wherein the reporter sequence is introduced into the 
nucleotide sequence at a location that expresses the N-terminal portion of a protein. 

34. The system of Claim 1 , wherein the reporter sequence is introduced into the 
1 0 nucleotide sequence at a location that expresses the C-terminal portion of a protein. 

35. A method of treating a disease or condition in an organism by introducing 
into the organism a pharmacologically effective dose of an upregulator identified in the method 
of Claim 12 or 16. 

15 

36. The method of Claim 35, wherein the upregulator increases the expression 
of the nucleotide sequence coding for the LDL receptor. 

37. The method of Claim 35, wherein the upregulator increases the expression 
20 of the nucleotide sequence coding for apolipoprotein A-I. 

38. The method of Claim 35, wherein the upregulator increases the expression 
of the nucleotide sequence coding for a tumor suppressor gene. 
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39. The method of Claim 1 , wherein the upregulator increases the expression of 
the nucleotide sequence coding for growth hormone. 



40. A method of treating a disease or condition in an organism by introducing 
5 into the organism a pharmacologically effective dose of a downregulator identified in the 
method of Claim 12 or 16. 
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